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METHODS FOR IDENTIFYING RISK OF OSTEOARTHRITIS AND 
TREATMENTS THEREOF 

Field of the Invention 

[0001] The invention relates to genetic methods for identifying risk of osteoarthritis and treatments 
that specifically target such diseases. 

Background 

[0002] Osteoarthritis (OA) is a chronic disease usually affecting weight-bearing synovial joints. 
There are approximately 20 million Americans affected by OA and it is the leading cause of disability in 
the United States. In addition to extensive human suffering, OA also accounts for nearly all knee 
replacements and more than half of all hip replacements in the United States. Despite its prevalence, OA 
is poorly understood and there are few treatments available besides anti-inflammatory drugs and joint 
replacement. 

[0003] Most commonly affecting middle-aged and older people, OA can range from very mild to 
very severe. It affects hands and weight-bearing joints such as knees, hips, feet and the back. Knee OA 
can be as disabling as any cardiovascular disease except stroke. 

[0004] OA is characterized by the breakdown of cartilage in joints. Cartilage in joints cushions the 
ends of bones, and cartilage breakdown causes bones to rub against each other, causing pain and loss of 
movement. Type II collagen is the main component of cartilage, comprising 15-25% of the wet weight, 
approximately half the dry weight, and representing 90-95% of the total collagen content in the tissue. It 
forms fibrils that endow cartilage with tensile strength (Mayne, R. Arthritis Rhuem. 32:241-246 (1989)). 

Summary 

[0005] It has been discovered that certain polymorphic variations in human genomic DNA are 
associated with osteoarthritis. In particular, polymorphic variants in a locus containing a TM7SF3 region 
in human genomic DNA have been associated with risk of osteoarthritis. 

[0006] Thus, featured herein are methods for identifying a subject at risk of osteoarthritis and/or a 
risk of osteoarthritis in a subject, which comprise detecting the presence or absence of one or more 
polymorphic variations associated with osteoarthritis in or around the loci described herein in a human 
nucleic acid sample. In an embodiment, two or more polymorphic variations are detected in two or more 
regions of which one is the TM7SF3 region. In certain embodiments, 3 or more, or 4, 5, 6, 7, 8, 9, 10, 1 1, 
12, 13, 14, 15, 16, 17, 18, 1 9 or 20 or more polymorphic variants are detected. 
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[0007] Also featured are nucleic acids that include one or more polymorphic variations associated 
with occurrence of osteoarthritis, as well as polypeptides encoded by these nucleic acids. In addition, 
provided are methods for identifying candidate therapeutic molecules for treating osteoarthritis, as well 
as methods for treating osteoarthritis in a subject by identifying a subject at risk of osteoarthritis and 
treating the subject with a suitable prophylactic, treatment or therapeutic molecule. 

[0008] Also provided are compositions comprising a cell from a subject having osteoarthritis or at 
risk of osteoarthritis and/or a TM7SF3 nucleic acid, with a RNAi, siRNA, antisense DNA or RNA, or 
ribozyme nucleic acid designed from a TM7SF3 nucleotide sequence. In an embodiment, the RNAi, 
siRNA, antisense DNA or RNA, or ribozyme nucleic acid is designed from a TM7SF3 nucleotide 
sequence that includes one or more polymorphic variations associated with osteoarthritis, and in some 
instances, specifically interacts with such a nucleotide sequence. Further, provided are arrays of nucleic 
acids bound to a solid surface, in which one or more nucleic acid molecules of the array have a TM7SF3 
nucleotide sequence, or a fragment or substantially identical nucleic acid thereof, or a complementary 
nucleic acid of the foregoing. Featured also are compositions comprising a cell from a subject having 
osteoarthritis or at risk of osteoarthritis and/or a TM7SF3 polypeptide, with an antibody that specifically 
binds to the polypeptide. In an embodiment, the antibody specifically binds to an epitope in the 
polypeptide that includes a non-synonymous amino acid modification associated with osteoarthritis (e.g., 
results in an amino acid substitution in the encoded polypeptide associated with osteoarthritis). 

Brief Description of the Drawings 

[0009] Figure 1 shows proximal SNPs in a TM7SF3 region in genomic DNA. The position of each 
SNP in the chromosome is shown on the x-axis and the y-axis provides the negative logarithm of the p- 
value comparing the estimated allele to that of the control group. Also shown in the figure are exons and 
introns of the gene region in the approximate chromosomal positions. 

Detailed Description 

[0010] It has been discovered that a polymorphic variant in a locus containing a TM7SF3 region is 
associated with occurrence of osteoarthritis in subjects. Thus, detecting genetic determinants associated 
with an increased risk of osteoarthritis occurrence can lead to early identification of a predisposition to 
osteoarthritis and early prescription of preventative measures. Also, associating a TM7SF3 polymorphic 
variant with osteoarthritis has provided new targets for screening molecules useful in treatments of 
osteoarthritis. 
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Osteoarthritis and Sample Selection 

[0011] Osteoarthritis (OA), or degenerative joint disease, is one of the oldest and most common 
types of arthritis. It is characterized by the breakdown of the joint's cartilage. Cartilage is the part of the 
joint that cushions the ends of bones, and its breakdown causes bones to rub against each other, causing 
pain and loss of movement. Type II collagen is the main component of cartilage, comprising 15-25% of 
the wet weight, approximately half the dry weight, and representing 90-95% of the total collagen content 
in the tissue. It forms fibrils that endow cartilage with tensile strength (Mayne, R. Arthritis Rhuem. 
32:241-246(1989)). 

[0012] Most commonly affecting middle-aged and older people, OA can range from very mild to 
very severe. It affects hands and weight-bearing joints such as knees, hips, feet and the back. Knee OA 
can be as disabling as any cardiovascular disease except stroke. 

[0013] Osteoarthritis affects an estimated 20.7 million Americans, mostly after age 45, with women 
more commonly affected than men. Physicians make a diagnosis of OA based on a physical exam and 
history of symptoms. X-rays are used to confirm diagnosis. Most people over 60 reflect the disease on 
X-ray, and about one-third have actual symptoms. 

[0014] There are many factors that can cause OA. Obesity may lead to osteoarthritis of the knees. 
In addition, people with joint injuries due to sports, work-related activity or accidents may be at increased 
risk of developing OA. 

[0015] Genetics has a role in the development of OA. Some people may be born with defective 
cartilage or with slight defects in the way that joints fit together. As a person ages, these defects may 
cause early cartilage breakdown in the joint or the inability to repair damaged or deteriorated cartilage in 
the joint. 

[0016] Inclusion or exclusion of samples for an osteoarthritis pool may be based upon the following 
criteria: ethnicity (e.g., samples derived from an individual characterized as Caucasian); parental 
ethnicity (e.g., samples derived from an individual of British paternal and maternal descent); relevant 
phenotype information for the individual (e.g., case samples derived from individuals diagnosed with 
specific knee osteoarthritis (OA) and were recruited from an OA knee replacement clinic). Control 
samples may be selected based on relevant phenotype information for the individual (e.g., derived from 
individuals free of OA at several sites (knee, hand, hip etc)); and no family history of OA and/or 
rheumatoid arthritis. Additional phenotype information collected for both cases and controls may include 
age of the individual, gender, family history of OA, diagnosis with osteoarthritis (joint location of OA, 
date of primary diagnosis, age of individual as of primary diagnosis), knee history (current symptoms, 
any major knee injury, menisectomy, knee replacement surgery, age of surgery), HRT history, 
osteoporosis diagnosis. 
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[0017] Based in part upon selection criteria set forth above, individuals having osteoarthritis can be 
selected for genetic studies. Also, individuals having no history of osteoarthritis often are selected for 
genetic studies, as described hereafter. 

Polymorphic Variants Associated with Osteoarthritis 

[0018] A genetic analysis provided herein linked osteoarthritis with polymorphic variant nucleic 
acid sequences in the human genome. As used herein, the term "polymorphic site" refers to a region in a 
nucleic acid at which two or more alternative nucleotide sequences are observed in a significant number 
of nucleic acid samples from a population of individuals. A polymorphic site may be a nucleotide 
sequence of two or more nucleotides, an inserted nucleotide or nucleotide sequence, a deleted nucleotide 
or nucleotide sequence, or a microsatellite, for example. A polymorphic site that is two or more 
nucleotides in length may be 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more, 20 or more, 30 or more, 50 
or more, 75 or more, 100 or more, 500 or more, or about 1000 nucleotides in length, where all or some of 
the nucleotide sequences differ within the region. A polymorphic site is often one nucleotide in length, 
which is referred to herein as a "single nucleotide polymorphism" or a "SNP." 

[0019] Where there are two, three, or four alternative nucleotide sequences at a polymorphic site, 
each nucleotide sequence is referred to as a "polymorphic variant" or "nucleic acid variant." Where two 
polymorphic variants exist, for example, the polymorphic variant represented in a minority of samples 
from a population is sometimes referred to as a "minor allele" and the polymorphic variant that is more 
prevalently represented is sometimes referred to as a "major allele." Many organisms possess a copy of 
each chromosome (e.g., humans), and those individuals who possess two major alleles or two minor 
alleles are often referred to as being "homozygous" with respect to the polymorphism, and those 
individuals who possess one major allele and one minor allele are normally referred to as being 
"heterozygous" with respect to the polymorphism. Individuals who are homozygous with respect to one 
allele are sometimes predisposed to a different phenotype as compared to individuals who are 
heterozygous or homozygous with respect to another allele. 

[0020] In genetic analysis that associate polymorphic variants with osteoarthritis, samples from 
individuals having osteoarthritis and individuals not having osteoarthritis often are allelotyped and/or 
genotyped. The term "allelotype" as used herein refers to a process for determining the allele frequency 
for a polymorphic variant in pooled DNA samples from cases and controls. By pooling DNA from each 
group, an allele frequency for each SNP in each group is calculated. These allele frequencies are then 
compared to one another. The term "genotyped" as used herein refers to a process for determining a 
genotype of one or more individuals, where a "genotype" is a representation of one or more polymorphic 
variants in a population. 
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[0021] A genotype or polymorphic variant may be expressed in terms of a "haplotype," which as 
used herein refers to two or more polymorphic variants occurring within genomic DNA in a group of 
individuals within a population. For example, two SNPs may exist within a gene where each SNP 
position includes a cytosine variation and an adenine variation. Certain individuals in a population may 
carry one allele (heterozygous) or two alleles (homozygous) having the gene with a cytosine at each SNP 
position. As the two cytosines corresponding to each SNP in the gene travel together on one or both 
alleles in these individuals, the individuals can be characterized as having a cytosine/cytosine haplotype 
with respect to the two SNPs in the gene. 

[0022] As used herein, the term "phenotype" refers to a trait which can be compared between 
individuals, such as presence or absence of a condition, a visually observable difference in appearance 
between individuals, metabolic variations, physiological variations, variations in the function of 
biological molecules, and the like. An example of a phenotype is occurrence of osteoarthritis. 

[0023] Researchers sometimes report a polymorphic variant in a database without determining 
whether the variant is represented in a significant fraction of a population. Because a subset of these 
reported polymorphic variants are not represented in a statistically significant portion of the population, 
some of them are sequencing errors and/or not biologically relevant. Thus, it is often not known whether 
a reported polymorphic variant is statistically significant or biologically relevant until the presence of the 
variant is detected in a population of individuals and the frequency of the variant is determined. Methods 
for detecting a polymorphic variant in a population are described herein, specifically in Example 2. A 
polymorphic variant is statistically significant and often biologically relevant if it is represented in 5% or 
more of a population, sometimes 10% or more, 15% or more, or 20% or more of a population, and often 
25% or more, 30% or more, 35% or more, 40% or more, 45% or more, or 50% or more of a population. 

[0024] A polymorphic variant may be detected on either or both strands of a double-stranded nucleic 
acid. Also, a polymorphic variant may be located within an intron or exon of a gene or within a portion 
of a regulatory region such as a promoter, a 5 ' untranslated region (UTR), a 3 ' UTR, and in DNA (e.g. , 
genomic DNA (gDNA) and complementary DNA (cDNA)), RNA (e.g., mRNA, tRNA, and rRNA), or a 
polypeptide. Polymorphic variations may or may not result in detectable differences in gene expression, 
polypeptide structure, or polypeptide function. 

[0025] It was determined that polymorphic variations associated with an increased risk of 
osteoarthritis existed in a TM7SF3 region in SEQ ID NO: 1 . In certain embodiments, a polymorphic 
variant at position rs 1484086 in the human genome was associated with an increased risk of 
osteoarthritis, and in a specific embodiment, a thymine at position rs 1484086 was associated with an 
increased risk of osteoarthritis. 

[0026] Polymorphic variants in and around the TM7SF3 locus were tested for association with 
osteoarthritis. These include polymorphic variants at positions in SEQ ID NO: 1 selected from the group 
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consisting of 230, 231, 5330, 6334, 11372, 11456, 11501, 13393, 16666, 17596, 19710, 19800, 20297, 
20967, 32514, 33159, 37600, 41259, 41329, 50060, 53292, 53393, 56417, 56435, 58847, 59595, 59661, 
60355, 60407, 62357, 68230, 68516, 69055, 72603, 73928, 85897 and 91554. Polymorphic variants at 
the following positions in SEQ ID NO: 1 in particular were associated with an increased risk of 
osteoarthritis: 56435, 59595, 53292, 33159 and 41329, with specific embodiments directed to positions 
56435 and/or 59595. In particular, the following polymorphic variants in SEQ ID NO: 1 were associated 
with risk of osteoarthritis: a thymine thymine repeat at position 56435, a thymine at position 59595, a 
cytosine at position 53292, a guanine at position 33159 and a thymine at position 41329. 

[0027] Based in part upon analyses summarized in Figure 1 , a region with significant association 
has been identified in a locus associated with osteoarthritis. Any polymorphic variants associated with 
osteoarthritis in a region of significant association can be utilized for embodiments described herein. For 
example, polymorphic variants in a region approximately 14,000 nucleotides in length spanning 
chromosome positions 27052000 to 27066000 in a TM7SF3 locus have significant association 
(chromosome positions are within NCBI's Genome build 34). 

Additional Polymorphic Variants Associated with Osteoarthritis 

[0028] Also provided is a method for identifying polymorphic variants proximal to an incident, 
founder polymorphic variant associated with osteoarthritis. Thus, featured herein are methods for 
identifying a polymorphic variation associated with osteoarthritis that is proximal to an incident 
polymorphic variation associated with osteoarthritis, which comprises identifying a polymorphic variant 
proximal to the incident polymorphic variant associated with osteoarthritis, where the incident 
polymorphic variant is in a TM7SF3 nucleotide sequence. The nucleotide sequence often comprises a 
polynucleotide sequence selected from the group consisting of (a) a polynucleotide sequence of SEQ ID 
NO: 1-2; (b) a polynucleotide sequence that encodes a polypeptide having an amino acid sequence 
encoded by a polynucleotide sequence of SEQ ID NO: 1-2; and (c) a polynucleotide sequence that 
encodes a polypeptide having an amino acid sequence that is 90% or more identical to an amino acid 
sequence encoded by a nucleotide sequence of SEQ ID NO: 1-2 or a polynucleotide sequence 90% or 
more identical to the polynucleotide sequence of SEQ ID NO: 1 -2. The presence or absence of an 
association of the proximal polymorphic variant with osteoarthritis then is determined using a known 
association method, such as a method described in the Examples hereafter. In an embodiment, the 
incident polymorphic variant is a polymorphic variant associated with osteoarthritis described herein. In 
another embodiment, the proximal polymorphic variant identified sometimes is a publicly disclosed 
polymorphic variant, which for example, sometimes is published in a publicly available database. In 
other embodiments, the polymorphic variant identified is not publicly disclosed and is discovered using a 
known method, including, but not limited to, sequencing a region surrounding the incident polymorphic 
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variant in a group of nucleic samples. Thus, multiple polymorphic variants proximal to an incident 
polymorphic variant are associated with osteoarthritis using this method. 

[0029] The proximal polymorphic variant often is identified in a region surrounding the incident 
polymorphic variant. In certain embodiments, this surrounding region is about 50 kb flanking the first 
polymorphic variant (e.g. about 50 kb 5' of the first polymorphic variant and about 50 kb 3' of the first 
polymorphic variant), and the region sometimes is composed of shorter flanking sequences, such as 
flanking sequences of about 40 kb, about 30 kb, about 25 kb, about 20 kb, about 15 kb, about 10 kb, 
about 7 kb, about 5 kb, or about 2 kb 5' and 3' of the incident polymorphic variant. In other 
embodiments, the region is composed of longer flanking sequences, such as flanking sequences of about 
55 kb, about 60 kb, about 65 kb, about 70 kb, about 75 kb, about 80 kb, about 85 kb, about 90 kb, about 
95 kb, or about 100 kb 5' and 3' of the incident polymorphic variant. 

[0030] In certain embodiments, polymorphic variants associated with osteoarthritis are identified 
iteratively. For example, a first proximal polymorphic variant is associated with osteoarthritis using the 
methods described above and then another polymorphic variant proximal to the first proximal 
polymorphic variant is identified (e.g., publicly disclosed or discovered) and the presence or absence of 
an association of one or more other polymorphic variants proximal to the first proximal polymorphic 
variant with osteoarthritis is determined. 

[0031] The methods described herein are useful for identifying or discovering additional 
polymorphic variants that may be used to further characterize a gene, region or loci associated with a 
condition, a disease (e.g. , osteoarthritis), or a disorder. For example, allelotyping or genotyping data 
from the additional polymorphic variants may be used to identify a functional mutation or a region of 
linkage disequilibrium. In certain embodiments, polymorphic variants identified or discovered within a 
region comprising the first polymorphic variant associated with osteoarthritis are genotyped using the 
genetic methods and sample selection techniques described herein, and it can be determined whether 
those polymorphic variants are in linkage disequilibrium with the first polymorphic variant. The size of 
the region in linkage disequilibrium with the first polymorphic variant also can be assessed using these 
genotyping methods. Thus, provided herein are methods for deteimining whether a polymorphic variant 
is in linkage disequilibrium with a first polymorphic variant associated with osteoarthritis, and such 
information can be used in prognosis/diagnosis methods described herein. 

Isolated Nucleic Acids 

[0032] Featured herein are isolated TM7SF3 nucleic acid variants depicted in SEQ ID NO: 1-2, and 
substantially identical nucleic acids thereof. A nucleic acid variant may be represented on one or both 
strands in a double-stranded nucleic acid or on one chromosomal complement (heterozygous) or both 
chromosomal complements (homozygous). 
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[0033] As used herein, the term "nucleic acid" includes DNA molecules (e.g. , a complementary 
DNA (cDNA) and genomic DNA (gDNA)) and RNA molecules (e.g., mRNA, rRNA, siRNA and tRNA) 
and analogs of DNA or RNA, for example, by use of nucleotide analogs. The nucleic acid molecule can 
be single-stranded and it is often double-stranded. The term "isolated or purified nucleic acid" refers to 
nucleic acids that are separated from other nucleic acids present in the natural source of the nucleic acid. 
For example, with regard to genomic DNA, the term "isolated" includes nucleic acids which are 
separated from the chromosome with which the genomic DNA is naturally associated. An "isolated" 
nucleic acid is often free of sequences which naturally flank the nucleic acid (i.e., sequences located at 
the 5' and/or 3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
acid is derived. For example, in various embodiments, the isolated nucleic acid molecule can contain less 
than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 5' and/or 3' nucleotide sequences which flank 
the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other cellular 
material, or culture medium when produced by recombinant techniques, or substantially free of chemical 
precursors or other chemicals when chemically synthesized. As used herein, the term "gene" refers to a 
nucleotide sequence that encodes a polypeptide. 

[0034] Also included herein are nucleic acid fragments. These fragments often have a nucleotide 
sequence identical to a nucleotide sequence of SEQ ID NO: 1 -2, a nucleotide sequence substantially 
identical to a nucleotide sequence of SEQ ID NO: 1-2, or a nucleotide sequence that is complementary to 
the foregoing. The nucleic acid fragment may be identical, substantially identical or homologous to a 
nucleotide sequence in an exon or an intron in a nucleotide sequence of SEQ ID NO: 1-2, and may 
encode a domain or part of a domain of a polypeptide. Sometimes, the fragment will comprises one or 
more of the polymorphic variations described herein as being associated with osteoarthritis. The nucleic 
acid fragment is often 50, 100, or 200 or fewer base pairs in length, and is sometimes about 300, 400, 500, 
600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500, 2000, 3000, 4000, 5000, 10000, 15000, or 20000 
base pairs in length. A nucleic acid fragment that is complementary to a nucleotide sequence identical or 
substantially identical to a nucleotide sequence in SEQ ID NO: 1-2 and hybridizes to such a nucleotide 
sequence under stringent conditions is often referred to as a "probe." Nucleic acid fragments often include 
one or more polymorphic sites, or sometimes have an end that is adjacent to a polymorphic site as described 
hereafter. 

[0035] An example of a nucleic acid fragment is an oligonucleotide. As used herein, the term 
"oligonucleotide" refers to a nucleic acid comprising about 8 to about 50 covalently linked nucleotides, 
often comprising from about 8 to about 35 nucleotides, and more often from about 10 to about 25 
nucleotides. The backbone and nucleotides within an oligonucleotide may be the same as those of 
naturally occurring nucleic acids, or analogs or derivatives of naturally occurring nucleic acids, provided 
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that oligonucleotides having such analogs or derivatives retain the ability to hybridize specifically to a 
nucleic acid comprising a targeted polymorphism. Oligonucleotides described herein may be used as 
hybridization probes or as components of prognostic or diagnostic assays, for example, as described 
herein. 

[0036] Oligonucleotides are typically synthesized using standard methods and equipment, such as 
the ABI™3900 High Throughput DNA Synthesizer and the EXPEDITE™ 8909 Nucleic Acid 
Synthesizer, both of which are available from Applied Biosystems (Foster City, CA). Analogs and 
derivatives are exemplified in U.S. Pat. Nos. 4,469,863; 5,536,821; 5,541,306; 5,637,683; 5,637,684; 
5,700,922; 5,717,083; 5,719,262; 5,739,308; 5,773,601; 5,886,165; 5,929,226; 5,977,296; 6,140,482; 
WO 00/56746; WO 01/14398, and related publications. Methods for synthesizing oligonucleotides 
comprising such analogs or derivatives are disclosed, for example, in the patent publications cited above 
and in U.S. Pat. Nos. 5,614,622; 5,739,314; 5,955,599; 5,962,674; 6,1 17,992; in WO 00/75372; and in 
related publications. 

[0037] Oligonucleotides may also be linked to a second moiety. The second moiety may be an 
additional nucleotide sequence such as a tail sequence (e.g., a polyadenosine tail), an adapter sequence 
(e.g., phage M13 universal tail sequence), and others. Alternatively, the second moiety may be a non- 
nucleotide moiety such as a moiety which facilitates linkage to a solid support or a label to facilitate 
detection of the oligonucleotide. Such labels include, without limitation, a radioactive label, a 
fluorescent label, a chemiluminescent label, a paramagnetic label, and the like. The second moiety may 
be attached to any position of the oligonucleotide, provided the oligonucleotide can hybridize to the 
nucleic acid comprising the polymorphism. 

Uses for Nucleic Acid Sequence 

[0038] Nucleic acid coding sequences may be used for diagnostic purposes for detection and control 
of polypeptide expression. Also, included herein are oligonucleotide sequences such as antisense RNA, 
small-interfering RNA (siRNA) and DNA molecules and ribozymes that function to inhibit translation of 
a polypeptide. Antisense techniques and RNA interference techniques are known in the art and are 
described herein. 

[0039] Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
hammerhead motif ribozyme molecules may be engineered that specifically and efficiently catalyze 
endonucleolytic cleavage of RNA sequences corresponding to or complementary to TM7SF3 nucleotide 
sequences. Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites which include the following sequences, GUA, 

9 



PATENT 
Docket 524593009100 



GUU and GUC. Once identified, short RNA sequences of between fifteen (15) and twenty (20) 
ribonucleotides corresponding to the region of the target gene containing the cleavage site may be 
evaluated for predicted structural features such as secondary structure that may render the oligonucleotide 
sequence unsuitable. The suitability of candidate targets may also be evaluated by testing their 
accessibility to hybridization with complementary oligonucleotides, using ribonuclease protection assays. 

[0040] Antisense RNA and DNA molecules, siRNA and ribozymes may be prepared by any method 
known in the art for the synthesis of RNA molecules. These include techniques for chemically 
synthesizing oligodeoxyribonucleotides well known in the art such as solid phase phosphoramidite 
chemical synthesis. Alternatively, RNA molecules may be generated by in vitro and in vivo transcription 
of DNA sequences encoding the antisense RNA molecule. Such DNA sequences may be incorporated 
into a wide variety of vectors which incorporate suitable RNA polymerase promoters such as the T7 or 
SP6 polymerase promoters. Alternatively, antisense cDNA constructs that synthesize antisense RNA 
constitutively or inducibly, depending on the promoter used, can be introduced stably into cell lines. 

[0041] DNA encoding a polypeptide also may have a number of uses for the diagnosis of diseases, 
including osteoarthritis, resulting from aberrant expression of a target gene described herein. For 
example, the nucleic acid sequence may be used in hybridization assays of biopsies or autopsies to 
diagnose abnormalities of expression or function (e.g., Southern or Northern blot analysis, in situ 
hybridization assays). 

[0042] In addition, the expression of a polypeptide during embryonic development may also be 
determined using nucleic acid encoding the polypeptide. As addressed, infra, production of functionally 
impaired polypeptide is the cause of various disease states, such as osteoarthritis. In situ hybridizations 
using polypeptide as a probe may be employed to predict problems related to osteoarthritis. Further, as 
indicated, infra, administration of human active polypeptide, recombinantly produced as described 
herein, may be used to treat disease states related to functionally impaired polypeptide. Alternatively, 
gene therapy approaches may be employed to remedy deficiencies of functional polypeptide or to replace 
or compete with dysfunctional polypeptide. 

Expression Vectors. Host Cells, and Genetically Engineered Cells 

[0043] Provided herein are nucleic acid vectors, often expression vectors, which contain a TM7SF3 
nucleotide sequence, or a substantially identical sequence thereof. As used herein, the term "vector" 
refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked 
and can include a plasmid, cosmid, or viral vector. The vector can be capable of autonomous replication 
or it can integrate into a host DNA. Viral vectors may include replication defective retroviruses, 
adenoviruses and adeno-associated viruses for example. 
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[0044] A vector can include a TM7SF3 nucleotide sequence in a form suitable for expression of an 
encoded target polypeptide or target nucleic acid in a host cell. A "target polypeptide" is a polypeptide 
encoded by a TM7SF3 nucleotide sequence, or a substantially identical nucleotide sequence thereof. The 
recombinant expression vector typically includes one or more regulatory sequences operatively linked to 
the nucleic acid sequence to be expressed. The term "regulatory sequence" includes promoters, 
enhancers and other expression control elements (e.g., polyadenylation signals). Regulatory sequences 
include those that direct constitutive expression of a nucleotide sequence, as well as tissue-specific 
regulatory and/or inducible sequences. The design of the expression vector can depend on such factors as 
the choice of the host cell to be transformed, the level of expression of polypeptide desired, and the like. 
Expression vectors can be introduced into host cells to produce target polypeptides, including fusion 
polypeptides. 

[0045] Recombinant expression vectors can be designed for expression of target polypeptides in 
prokaryotic or eukaryotic cells. For example, target polypeptides can be expressed in E. coli, insect cells 
(e.g., using baculovirus expression vectors), yeast cells, or mammalian cells. Suitable host cells are 
discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, CA (1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

[0046] Expression of polypeptides in prokaryotes is most often carried out in E. coli with vectors 
containing constitutive or inducible promoters directing the expression of either fusion or non-fusion 
polypeptides. Fusion vectors add a number of amino acids to a polypeptide encoded therein, usually to 
the amino terminus of the recombinant polypeptide. Such fusion vectors typically serve three purposes: 
1) to increase expression of recombinant polypeptide; 2) to increase the solubility of the recombinant 
polypeptide; and 3) to aid in the purification of the recombinant polypeptide by acting as a ligand in 
affinity purification. Often, a proteolytic cleavage site is introduced at the junction of the fusion moiety 
and the recombinant polypeptide to enable separation of the recombinant polypeptide from the fusion 
moiety subsequent to purification of the fusion polypeptide. Such enzymes, and their cognate 
recognition sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors 
include pGEX (Pharmacia Biotech Inc; Smith & Johnson, Gene 67: 31-40 (1988)), pMAL (New England 
Biolabs, Beverly, MA) and pRJT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase 
(GST), maltose E binding polypeptide, or polypeptide A, respectively, to the target recombinant 
polypeptide. 

[0047] Purified fusion polypeptides can be used in screening assays and to generate antibodies 
specific for target polypeptides. In a therapeutic embodiment, fusion polypeptide expressed in a 
retroviral expression vector is used to infect bone marrow cells that are subsequently transplanted into 
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irradiated recipients. The pathology of the subject recipient is then examined after sufficient time has 
passed (e.g., six (6) weeks). 

[0048] Expressing the polypeptide in host bacteria with an impaired capacity to proteolytically 
cleave the recombinant polypeptide is often used to maximize recombinant polypeptide expression 
(Gottesman, S., Gene Expression Technology: Methods in Enzymology, Academic Press, San Diego, 
California 185: 1 19-128 (1990)). Another strategy is to alter the nucleotide sequence of the nucleic acid 
to be inserted into an expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et ai, Nucleic Acids Res. 20: 2111-2118 (1992)). Such alteration 
of nucleotide sequences can be carried out by standard DNA synthesis techniques. 

[0049] When used in mammalian cells, the expression vector's control functions are often provided 
by viral regulatory elements. For example, commonly used promoters are derived from polyoma, 
Adenovirus 2, cytomegalovirus and Simian Virus 40. Recombinant mammalian expression vectors are 
often capable of directing expression of the nucleic acid in a particular cell type (e.g., tissue-specific 
regulatory elements are used to express the nucleic acid). Non-limiting examples of suitable tissue- 
specific promoters include an albumin promoter (liver-specific; Pinkert et ai, Genes Dev. 1: 268-277 
(1987)), lymphoid-specific promoters (Calame & Eaton, Adv. Immunol. 43: 235-275 (1988)), promoters 
of T cell receptors (Winoto & Baltimore, EMBO J. 8: 729-733 (1989)) promoters of immunoglobulins 
(Banerji et al, Cell 33: 729-740 (1983); Queen & Baltimore, Cell 33: 741-748 (1983)), neuron-specific 
promoters (e.g., the neurofilament promoter; Byrne & Ruddle, Proc. Natl. Acad. Sci. USA 86: 5473- 
5477 (1989)), pancreas-specific promoters (Edlund et al. Science 230: 912-916 (1985)), and mammary 
gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,316 and European 
Application Publication No. 264,166). Developmentally-regulated promoters are sometimes utilized, for 
example, the murine hox promoters (Kessel & Gruss, Science 249: 374-379 (1990)) and the o> 
fetopolypeptide promoter (Campes & Tilghman, Genes Dev. 3: 537-546 (1989)). 

[0050] A TM7SF3 nucleic acid also may be cloned into an expression vector in an antisense 
orientation. Regulatory sequences (e.g., viral promoters and/or enhancers) operatively linked to a 
TM7SF3 nucleic acid cloned in the antisense orientation can be chosen for directing constitutive, tissue 
specific or cell type specific expression of antisense RNA in a variety of cell types. Antisense expression 
vectors can be in the form of a recombinant plasmid, phagemid or attenuated virus. For a discussion of 
the regulation of gene expression using antisense genes see, e.g., Weintraub et al., Antisense RNA as a 
molecular tool for genetic analysis, Reviews - Trends in Genetics, Vol. 1(1) (1986). 

[0051] Also provided herein are host cells that include a TM7SF3 nucleotide sequence within a 
recombinant expression vector or a fragment of such a nucleotide sequence which facilitate homologous 
recombination into a specific site of the host cell genome. The terms "host cell" and "recombinant host 
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cell" are used interchangeably herein. Such terms refer not only to the particular subject cell but rather 
also to the progeny or potential progeny of such a cell. Because certain modifications may occur in 
succeeding generations due to either mutation or environmental influences, such progeny may not, in 
fact, be identical to the parent cell, but are still included within the scope of the term as used herein. A 
host cell can be any prokaryotic or eukaryotic cell. For example, a target polypeptide can be expressed in 
bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells 
(CHO) or COS cells). Other suitable host cells are known to those skilled in the art. 

[0052] Vectors can be introduced into host cells via conventional transformation or transfection 
techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a 
variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co-precipitation, transduction/infection, DEAE-dextran- 
mediated transfection, lipofection, or electroporation. 

[0053] A host cell provided herein can be used to produce (i.e., express) a target polypeptide or a 
substantially identical polypeptide thereof. Accordingly, further provided are methods for producing a 
target polypeptide using host cells described herein. In one embodiment, the method includes culturing 
host cells into which a recombinant expression vector encoding a target polypeptide has been introduced 
in a suitable medium such that a target polypeptide is produced. In another embodiment, the method 
further includes isolating a target polypeptide from the medium or the host cell. 

[0054] Also provided are cells or purified preparations of cells which include a TM7SF3 transgene, 
or which otherwise misexpress target polypeptide. Cell preparations can consist of human or non-human 
cells, e.g. , rodent cells, e.g. , mouse or rat cells, rabbit cells, or pig cells. In preferred embodiments, the 
cell or cells include a TM7SF3 transgene (e.g. , a heterologous form of a TM7SF3 gene, such as a human 
gene expressed in non-human cells). The transgene can be misexpressed, e.g. , overexpressed or 
underexpressed. In other preferred embodiments, the cell or cells include a gene which misexpress an 
endogenous target polypeptide (e.g., expression of a gene is disrupted, also known as a knockout). Such 
cells can serve as a model for studying disorders which are related to mutated or mis-expressed alleles or 
for use in drug screening. Also provided are human cells (e.g. , a hematopoietic stem cells) transfected 
with a TM7SF3 nucleic acid. 

[0055] Also provided are cells or a purified preparation thereof (e.g. , human cells) in which an 
endogenous TM7SF3 nucleic acid is under the control of a regulatory sequence that does not normally 
control the expression of the endogenous gene. The expression characteristics of an endogenous gene 
within a cell (e.g. , a cell line or microorganism) can be modified by inserting a heterologous DNA 
regulatory element into the genome of the cell such that the inserted regulatory element is operably 
linked to the corresponding endogenous gene. For example, an endogenous corresponding gene (e.g., a 
gene which is "transcriptionally silent," not normally expressed, or expressed only at very low levels) 
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may be activated by inserting a regulatory element which is capable of promoting the expression of a 
normally expressed gene product in that cell. Techniques such as targeted homologous recombinations, 
can be used to insert the heterologous DNA as described in, e.g., Chappel, US 5,272,071; WO 91/06667, 
published on May 16, 1991. 

Transgenic Animals 

[0056] Non-human transgenic animals that express a heterologous target polypeptide (e.g. , 
expressed from a TM7SF3 nucleic acid or substantially identical sequence thereof) can be generated. 
Such animals are useful for studying the function and/or activity of a target polypeptide and for 
identifying and/or evaluating modulators of the activity of TM7SF3 nucleic acids and encoded 
polypeptides. As used herein, a "transgenic animal" is a non-human animal such as a mammal (e.g., a 
non-human primate such as chimpanzee, baboon, or macaque; an ungulate such as an equine, bovine, or 
caprine; or a rodent such as a rat, a mouse, or an Israeli sand rat), a bird (e.g. , a chicken or a turkey), an 
amphibian (e.g., a frog, salamander, or newt), or an insect (e.g., Drosophila melanogaster), in which one 
or more of the cells of the animal includes a transgene. A transgene is exogenous DNA or a 
rearrangement (e.g. , a deletion of endogenous chromosomal DNA) that is often integrated into or occurs 
in the genome of cells in a transgenic animal. A transgene can direct expression of an encoded gene 
product in one or more cell types or tissues of the transgenic animal, and other transgenes can reduce 
expression (e.g. , a knockout). Thus, a transgenic animal can be one in which an endogenous nucleic acid 
homologous to a TM7SF3 nucleic acid has been altered by homologous recombination between the 
endogenous gene and an exogenous DNA molecule introduced into a cell of the animal (e.g., an 
embryonic cell of the animal) prior to development of the animal. 

[0057] Intronic sequences and polyadenylation signals can also be included in the transgene to 
increase expression efficiency of the transgene. One or more tissue-specific regulatory sequences can be 
operably linked to a TM7SF3 nucleotide sequence to direct expression of an encoded polypeptide to 
particular cells. A transgenic founder animal can be identified based upon the presence of a TM7SF3 
nucleotide sequence in its genome and/or expression of encoded mRNA in tissues or cells of the animals. 
A transgenic founder animal can then be used to breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying a TM7SF3 nucleotide sequence can further be bred to other 
transgenic animals carrying other transgenes. 

[0058] Target polypeptides can be expressed in transgenic animals or plants by introducing, for 
example, a TM7SF3 nucleic acid into the genome of an animal that encodes the target polypeptide. In 
preferred embodiments the nucleic acid is placed under the control of a tissue specific promoter, e.g. , a 
milk or egg specific promoter, and recovered from the milk or eggs produced by the animal. Also 
included is a population of cells from a transgenic animal. 
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Target Polypeptides 

[0059J Also featured herein are isolated target polypeptides, which are encoded by a TM7SF3 
nucleotide sequence (e.g., SEQ ID NO: 1-2), or a substantially identical nucleotide sequence thereof. 
Examples of TM7SF3 polypeptides are set forth in SEQ ID NO: 3. The term "polypeptide" as used 
herein includes proteins and peptides. An "isolated" or "purified" polypeptide or protein is substantially 
free of cellular material or other contaminating proteins from the cell or tissue source from which the 
protein is derived, or substantially free from chemical precursors or other chemicals when chemically 
synthesized. In one embodiment, the language "substantially free" means preparation of a target 
polypeptide having less than about 30%, 20%, 1 0% and more preferably 5% (by dry weight), of non- 
target polypeptide (also referred to herein as a "contaminating protein"), or of chemical precursors or 
non-target chemicals. When the target polypeptide or a biologically active portion thereof is 
recombinantly produced, it is also preferably substantially free of culture medium, specifically, where 
culture medium represents less than about 20%, sometimes less than about 10%, and often less than 
about 5% of the volume of the polypeptide preparation. Isolated or purified target polypeptide 
preparations are sometimes 0.01 milligrams or more or 0.1 milligrams or more, and often 1 .0 milligrams 
or more and 10 milligrams or more in dry weight. 

[0060] Further included herein are target polypeptide fragments. The polypeptide fragment may be 
a domain or part of a domain of a target polypeptide. The polypeptide fragment may have increased, 
decreased or unexpected biological activity. The polypeptide fragment is often 50 or fewer, 100 or fewer, 
or 200 or fewer amino acids in length, and is sometimes 300, 400, 500, 600, 700, or 900 or fewer amino 
acids in length. 

[0061] Substantially identical target polypeptides may depart from the amino acid sequences of 
target polypeptides in different manners. For example, conservative amino acid modifications may be 
introduced at one or more positions in the amino acid sequences of target polypeptides. A "conservative 
amino acid substitution" is one in which the amino acid is replaced by another amino acid having a 
similar structure and/or chemical function. Families of amino acid residues having similar structures and 
functions are well known. These families include amino acids with basic side chains (e.g. , lysine, 
arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains 
(e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side 
chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Also, essential and non-essential amino acids may be replaced. A "non-essential" 
amino acid is one that can be altered without abolishing or substantially altering the biological function 
of a target polypeptide, whereas altering an "essential" amino acid abolishes or substantially alters the 
biological function of a target polypeptide. Amino acids that are conserved among target polypeptides 
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are typically essential amino acids. In certain embodiments, the polypeptide includes one or more non- 
synonymous polymorphic variants associated with osteoarthritis. 

[0062] Also, target polypeptides may exist as chimeric or fusion polypeptides. As used herein, a 
target "chimeric polypeptide" or target "fusion polypeptide" includes a target polypeptide linked to a 
non-target polypeptide. A "non-target polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a polypeptide which is not substantially identical to the target polypeptide, 
which includes, for example, a polypeptide that is different from the target polypeptide and derived from 
the same or a different organism. The target polypeptide in the fusion polypeptide can correspond to an 
entire or nearly entire target polypeptide or a fragment thereof. The non-target polypeptide can be fused 
to the N-terminus or C-terminus of the target polypeptide. 

[0063] Fusion polypeptides can include a moiety having high affinity for a ligand. For example, the 
fusion polypeptide can be a GST-target fusion polypeptide in which the target sequences are fused to the 
C-terminus of the GST sequences, or a polyhistidine-target fusion polypeptide in which the target 
polypeptide is fused at the N- or C-terminus to a string of histidine residues. Such fusion polypeptides 
can facilitate purification of recombinant target polypeptide. Expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide), and a nucleotide sequence in 
SEQ ID NO: 1-2, or a substantially identical nucleotide sequence thereof, can be cloned into an 
expression vector such that the fusion moiety is linked in-frame to the target polypeptide. Further, the 
fusion polypeptide can be a target polypeptide containing a heterologous signal sequence at its N- 
terminus. In certain host cells (e.g., mammalian host cells), expression, secretion, cellular internalization, 
and cellular localization of a target polypeptide can be increased through use of a heterologous signal 
sequence. Fusion polypeptides can also include all or a part of a serum polypeptide (e.g. , an IgG constant 
region or human serum albumin). 

[0064] Target polypeptides can be incorporated into pharmaceutical compositions and administered 
to a subject in vivo. Administration of these target polypeptides can be used to affect the bioavailability 
of a substrate of the target polypeptide and may effectively increase target polypeptide biological activity 
in a cell. Target fusion polypeptides may be useful therapeutically for the treatment of disorders caused 
by, for example, (i) aberrant modification or mutation of a gene encoding a target polypeptide; (ii) mis- 
regulation of the gene encoding the target polypeptide; and (iii) aberrant post-translational modification 
of a target polypeptide. Also, target polypeptides can be used as immunogens to produce anti-target 
antibodies in a subject, to purify target polypeptide ligands or binding partners, and in screening assays to 
identify molecules which inhibit or enhance the interaction of a target polypeptide with a substrate. 

[0065] In addition, polypeptides can be chemically synthesized using techniques known in the art 
(See, e.g., Creighton, 1983 Proteins. New York, N.Y.: W. H. Freeman and Company; and Hunkapiller et 
al., (1984) Nature July 12 -18;310(5973):105-1 1). For example, a relative short fragment can be 
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synthesized by use of a peptide synthesizer. Furthermore, if desired, non-classical amino acids or 
chemical amino acid analogs can be introduced as a substitution or addition into the fragment sequence. 
Non-classical amino acids include, but are not limited to, to the D-isomers of the common amino acids, 
2,4-diaminobutyric acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g- 
Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, 
norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, b-alanine, fluoroamino acids, designer amino acids such 
as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid analogs in 
general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary). 

[0066] Polypeptides and polypeptide fragments sometimes are differentially modified during or after 
translation, e.g., by glycosylation, acetylation, phosphorylation, amidation, derivatization by known 
protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, 
etc. Any of numerous chemical modifications may be carried out by known techniques, including but not 
limited, to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 protease, 
NaBH4; acetylation, formylation, oxidation, reduction; metabolic synthesis in the presence of 
tunicamycin; and the like. Additional post-translational modifications include, for example, N-linked or 
O-linked carbohydrate chains, processing of N-terminal or C-terminal ends), attachment of chemical 
moieties to the amino acid backbone, chemical modifications of N-linked or O-linked carbohydrate 
chains, and addition or deletion of an N-terminal methionine residue as a result of prokaryotic host cell 
expression. The polypeptide fragments may also be modified with a detectable label, such as an 
enzymatic, fluorescent, isotopic or affinity label to allow for detection and isolation of the polypeptide. 

[0067] Also provided are chemically modified derivatives of polypeptides that can provide 
additional advantages such as increased solubility, stability and circulating time of the polypeptide, or 
decreased immunogenicity {see e.g., U.S. Pat. No: 4,179,337. The chemical moieties for derivitization 
may be selected from water soluble polymers such as polyethylene glycol, ethylene glycol/propylene 
glycol copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the like. The polypeptides 
may be modified at random positions within the molecule, or at predetermined positions within the 
molecule and may include one, two, three or more attached chemical moieties. 

[0068] The polymer may be of any molecular weight, and may be branched or unbranched. For 
polyethylene glycol, the preferred molecular weight is between about 1 kDa and about 100 kDa (the term 
"about" indicating that in preparations of polyethylene glycol, some molecules will weigh more, some 
less, than the stated molecular weight) for ease in handling and manufacturing. Other sizes may be used, 
depending on the desired therapeutic profile (e.g., the duration of sustained release desired, the effects, if 
any on biological activity, the ease in handling, the degree or lack of antigenicity and other known effects 
of the polyethylene glycol to a therapeutic protein or analog). 
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[0069] The polymers should be attached to the polypeptide with consideration of effects on 
functional or antigenic domains of the polypeptide. There are a number of attachment methods available 
to those skilled in the art (e.g., EP 0 401 384 (coupling PEG to G-CSF) and Malik et al. (1992) Exp 
Hematol. September;20(8): 1028-35 (pegylation of GM-CSF using tresyl chloride)). For example, 
polyethylene glycol may be covalently bound through amino acid residues via a reactive group, such as a 
free amino or carboxyl group. Reactive groups are those to which an activated polyethylene glycol 
molecule may be bound. The amino acid residues having a free amino group may include lysine residues 
and the N-terminal amino acid residues; those having a free carboxyl group may include aspartic acid 
residues, glutamic acid residues and the C-terminal amino acid residue. Sulfhydryl groups may also be 
used as a reactive group for attaching the polyethylene glycol molecules. For therapeutic purposes, the 
attachment sometimes is at an amino group, such as attachment at the N-terminus or lysine group. 

[0070] Proteins can be chemically modified at the N-terminus. Using polyethylene glycol as an 
illustration of such a composition, one may select from a variety of polyethylene glycol molecules (by 
molecular weight, branching, and the like), the proportion of polyethylene glycol molecules to protein 
(polypeptide) molecules in the reaction mix, the type of pegylation reaction to be performed, and the 
method of obtaining the selected N-terminally pegylated protein. The method of obtaining the N- 
terminally pegylated preparation (i.e., separating this moiety from other monopegylated moieties if 
necessary) may be by purification of the N-terminally pegylated material from a population of pegylated 
protein molecules. Selective proteins chemically modified at the N-terminus may be accomplished by 
reductive alkylation, which exploits differential reactivity of different types of primary amino groups 
(lysine versus the N-terminal) available for derivatization in a particular protein. Under the appropriate 
reaction conditions, substantially selective derivatization of the protein at the N-terminus with a carbonyl 
group containing polymer is achieved. 

Substantially Identical Nucleic Acids and Polypeptides 

[0071] Nucleotide sequences and polypeptide sequences that are substantially identical to a TM7SF3 
nucleotide sequence and the target polypeptide sequences encoded by those nucleotide sequences, 
respectively, are included herein. The term "substantially identical" as used herein refers to two or more 
nucleic acids or polypeptides sharing one or more identical nucleotide sequences or polypeptide 
sequences, respectively. Included are nucleotide sequences or polypeptide sequences that are 55% or 
more, 60% or more, 65% or more, 70% or more, 75% or more, 80% or more, 85% or more, 90% or more, 
95% or more (each often within a 1%, 2%, 3% or 4% variability) identical to a TM7SF3 nucleotide 
sequence or the encoded target polypeptide amino acid sequences. One test for determining whether two 
nucleic acids are substantially identical is to determine the percent of identical nucleotide sequences or 
polypeptide sequences shared between the nucleic acids or polypeptides. 
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[0072] Calculations of sequence identity are often performed as follows. Sequences are aligned for 
optimal comparison purposes (e.g. , gaps can be introduced in one or both of a first and a second amino 
acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded 
for comparison purposes). The length of a reference sequence aligned for comparison purposes is 
sometimes 30% or more, 40% or more, 50% or more, often 60% or more, and more often 70% or more, 
80% or more, 90% or more, or 100% of the length of the reference sequence. The nucleotides or amino 
acids at corresponding nucleotide or polypeptide positions, respectively, are then compared among the 
two sequences. When a position in the first sequence is occupied by the same nucleotide or amino acid 
as the corresponding position in the second sequence, the nucleotides or amino acids are deemed to be 
identical at that position. The percent identity between the two sequences is a function of the number of 
identical positions shared by the sequences, taking into account the number of gaps, and the length of 
each gap, introduced for optimal alignment of the two sequences. 

[0073] Comparison of sequences and determination of percent identity between two sequences can 
be accomplished using a mathematical algorithm. Percent identity between two amino acid or nucleotide 
sequences can be determined using the algorithm of Meyers & Miller, CABIOS 4: 1 1-17 (1989), which 
has been incorporated into the ALIGN program (version 2.0), using a PAM1 20 weight residue table, a 
gap length penalty of 12 and a gap penalty of 4. Also, percent identity between two amino acid 
sequences can be determined using the Needleman & Wunsch, J. Mol. Biol. 48: 444-453 (1970) 
algorithm which has been incorporated into the GAP program in the GCG software package (available at 
the http address www.gcg.com), using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight 
of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. Percent identity between two 
nucleotide sequences can be determined using the GAP program in the GCG software package (available 
at http address www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 
80 and a length weight of 1, 2, 3, 4, 5, or 6. A set of parameters often used is a Blossum 62 scoring 
matrix with a gap open penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5. 

[0074] Another manner for determining if two nucleic acids are substantially identical is to assess 
whether a polynucleotide homologous to one nucleic acid will hybridize to the other nucleic acid under 
stringent conditions. As use herein, the term "stringent conditions" refers to conditions for hybridization 
and washing. Stringent conditions are known to those skilled in the art and can be found in Current 
Protocols in Molecular Biology, John Wiley & Sons, N.Y. , 6.3.1-6.3.6 (1989). Aqueous and non- 
aqueous methods are described in that reference and either can be used. An example of stringent 
hybridization conditions is hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 50°C. Another example of stringent 
hybridization conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
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followed by one or more washes in 0.2X SSC, 0.1% SDS at 55°C. A further example of stringent 
hybridization conditions is hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 60°C. Often, stringent hybridization 
conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by one 
or more washes in 0.2X SSC, 0.1% SDS at 65°C. More often, stringency conditions are 0.5M sodium 
phosphate, 7% SDS at 65°C, followed by one or more washes at 0.2X SSC, 1% SDS at 65°C. 

[0075] An example of a substantially identical nucleotide sequence to a nucleotide sequence in SEQ 
ID NO: 1-2 is one that has a different nucleotide sequence but still encodes the same polypeptide 
sequence encoded by the nucleotide sequence in SEQ ID NO: 1 -2. Another example is a nucleotide 
sequence that encodes a polypeptide having a polypeptide sequence that is more than 70% or more 
identical to, sometimes more than 75% or more, 80% or more, or 85% or more identical to, and often 
more than 90% or more and 95% or more identical to a polypeptide sequence encoded by a nucleotide 
sequence in SEQ ID NO: 1-2. 

[0076] Nucleotide sequences in SEQ ID NO: 1-2 and amino acid sequences of encoded polypeptides 
can be used as "query sequences" to perform a search against public databases to identify other family 
members or related sequences, for example. Such searches can be performed using the NBLAST and 
XBLAST programs (version 2.0) of Altschul et al, J. Mol. Biol. 215: 403-10 (1990). BLAST nucleotide 
searches can be performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
nucleotide sequences homologous to nucleotide sequences in SEQ ED NO: 1-2. BLAST polypeptide 
searches can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to polypeptides encoded by the nucleotide sequences of SEQ ED NO: 1-2. To 
obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul etal, Nucleic Acids Res. 25(17): 3389-3402 (1997). When utilizing BLAST and Gapped 
BLAST programs, default parameters of the respective programs {e.g., XBLAST and NBLAST) can be 
used {see the http address www.ncbi.nlm.nih.gov). 

[0077] A nucleic acid that is substantially identical to a nucleotide sequence in SEQ ED NO: 1-2 
may include polymorphic sites at positions equivalent to those described herein when the sequences are 
aligned. For example, using the alignment procedures described herein, SNPs in a sequence substantially 
identical to a sequence in SEQ ED NO: 1-2 can be identified at nucleotide positions that match {i.e., align) 
with nucleotides at SNP positions in each nucleotide sequence in SEQ ED NO: 1-2. Also, where a 
polymorphic variation results in an insertion or deletion, insertion or deletion of a nucleotide sequence 
from a reference sequence can change the relative positions of other polymorphic sites in the nucleotide 
sequence. 
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[0078] Substantially identical nucleotide and polypeptide sequences include those that are naturally 
occurring, such as allelic variants (same locus), splice variants, homologs (different locus), and orthologs 
(different organism) or can be non-naturally occurring. Non-naturally occurring variants can be 
generated by mutagenesis techniques, including those applied to polynucleotides, cells, or organisms. 
The variants can contain nucleotide substitutions, deletions, inversions and insertions. Variation can 
occur in either or both the coding and non-coding regions. The variations can produce both conservative 
and non-conservative amino acid substitutions (as compared in the encoded product). Orthologs, 
homologs, allelic variants, and splice variants can be identified using methods known in the art. These 
variants normally comprise a nucleotide sequence encoding a polypeptide that is 50% or more, about 
55% or more, often about 70-75% or more or about 80-85% or more, and sometimes about 90-95% or 
more identical to the amino acid sequences of target polypeptides or a fragment thereof. Such nucleic 
acid molecules can readily be identified as being able to hybridize under stringent conditions to a 
nucleotide sequence in SEQ ID NO: 1-2 or a fragment of this sequence. Nucleic acid molecules 
corresponding to orthologs, homologs, and allelic variants of a nucleotide sequence in SEQ ID NO: 1-2 
can further be identified by mapping the sequence to the same chromosome or locus as the nucleotide 
sequence in SEQ ID NO: 1-2. 

[0079] Also, substantially identical nucleotide sequences may include codons that are altered with 
respect to the naturally occurring sequence for enhancing expression of a target polypeptide in a 
particular expression system. For example, the nucleic acid can be one in which one or more codons are 
altered, and often 10% or more or 20% or more of the codons are altered for optimized expression in 
bacteria (e.g., E. coli.), yeast (e.g., S. cervesiae), human (e.g., 293 cells), insect, or rodent (e.g., hamster) 
cells. 

Methods for Identifying Risk of Osteoarthritis 

[0080] Methods for prognosing and diagnosing osteoarthritis are included herein. These methods 
include detecting the presence or absence of one or more polymorphic variations in a nucleotide sequence 
associated with osteoarthritis, such as variants in or around the loci set forth herein, or a substantially 
identical sequence thereof, in a sample from a subject, where the presence of a polymorphic variant 
described herein is indicative of a risk of osteoarthritis. Determining a risk of osteoarthritis sometimes 
refers to determining whether an individual is at an increased risk of osteoarthritis (e.g., intermediate risk 
or higher risk). 

[0081] Thus, featured herein is a method for identifying a subject who is at risk of osteoarthritis, 
which comprises detecting an aberration associated with osteoarthritis in a nucleic acid sample from the 
subject. An embodiment is a method for detecting a risk of osteoarthritis in a subject, which comprises 
detecting the presence or absence of a polymorphic variation associated with osteoarthritis at a 
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polymorphic site in a nucleotide sequence in a nucleic acid sample from a subject, where the nucleotide 
sequence comprises a polynucleotide sequence selected from the group consisting of: (a) a nucleotide 
sequence of SEQ ID NO: 1 -2; (b) a nucleotide sequence which encodes a polypeptide consisting of an 
amino acid sequence encoded by a nucleotide sequence of SEQ ED NO: 1-2; (c) a nucleotide sequence 
which encodes a polypeptide that is 90% or more identical to an amino acid sequence encoded by a 
nucleotide sequence of SEQ ID NO: 1 -2, or a nucleotide sequence about 90% or more identical to a 
nucleotide sequence of SEQ ID NO: 1 -2; and (d) a fragment of a nucleotide sequence of (a), (b), or (c) 
comprising the polymorphic site; whereby the presence of the polymorphic variation is indicative of a 
predisposition to osteoarthritis in the subject. In certain embodiments, polymorphic variants at the 
positions described herein are detected for determining a risk of osteoarthritis, and polymorphic variants 
at positions in linkage disequilibrium with these positions are detected for determining a risk of 
osteoarthritis. As used herein, "SEQ ID NO: 1-2" refers to individual sequences in SEQ ED NO: 1 or 2, 
each sequence being separately applicable to embodiments described herein. 

[0082] Risk of osteoarthritis sometimes is expressed as a probability, such as an odds ratio, 
percentage, or risk factor. Risk often is based upon the presence or absence of one or more polymorphic 
variants described herein, and also may be based in part upon phenotypic traits of the individual being 
tested. Methods for calculating risk based upon patient data are well known {see, e.g. , Agresti, 
Categorical Data Analysis, 2nd Ed. 2002. Wiley). Allelotyping and genotyping analyses may be carried 
out in populations other than those exemplified herein to enhance the predictive power of the prognostic 
method. These further analyses are executed in view of the exemplified procedures described herein, and 
may be based upon the same polymorphic variations or additional polymorphic variations. 

[0083] In certain embodiments, determining the presence of a combination of two or more 
polymorphic variants associated with osteoarthritis in one or more genetic loci (e.g., one or more genes) 
of the sample is determined to identify, quantify and/or estimate, risk of osteoarthritis. The risk often is 
the probability of having or developing osteoarthritis. The risk sometimes is expressed as a relative risk 
with respect to a population average risk of osteoarthritis, and sometimes is expressed as a relative risk 
with respect to the lowest risk group. Such relative risk assessments often are based upon penetrance 
values determined by statistical methods, and are particularly useful to clinicians and insurance 
companies for assessing risk of osteoarthritis (e.g., a clinician can target appropriate detection, prevention 
and therapeutic regimens to a patient after determining the patient's risk of osteoarthritis, and an 
insurance company can fine tune actuarial tables based upon population genotype assessments of 
osteoarthritis risk). Risk of osteoarthritis sometimes is expressed as an odds ratio, which is the odds of a 
particular person having a genotype has or will develop osteoarthritis with respect to another genotype 
group (e.g., the most disease protective genotype or population average). In related embodiments, the 
determination is utilized to identify a subject at risk of osteoarthritis. In an embodiment, two or more 
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polymorphic variations are detected in two or more regions in human genomic DNA associated with 
increased risk of osteoarthritis, such as a locus containing a TM7SF3, for example. In certain 
embodiments, 3 or more, or 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 or more 
polymorphic variants are detected in the sample. In specific embodiments, polymorphic variants are 
detected in a TM7SF3 region, for example. In certain embodiments, polymorphic variants are detected at 
other genetic loci (e.g., the polymorphic variants can be detected in TM7SF3 in addition to other loci or 
only in other loci), where the other loci include but are not limited to those described in concurrently- 
filed patent applications having attorney docket number 524593008700, 524593008800, 524593008900, 
524593009000 or 524593009200, which is incorporated herein by reference in its entirety. 

[0084] Results from prognostic tests may be combined with other test results to diagnose 
osteoarthritis. For example, prognostic results may be gathered, a patient sample may be ordered based 
on a determined predisposition to osteoarthritis, the patient sample is analyzed, and the results of the 
analysis may be utilized to diagnose osteoarthritis. Also osteoarthritis diagnostic method can be 
developed from studies used to generate prognostic methods in which populations are stratified into 
subpopulations having different progressions of osteoarthritis. In another embodiment, prognostic results 
may be gathered, a patient's risk factors for developing osteoarthritis {e.g., age, weight, race, diet) 
analyzed, and a patient sample may be ordered based on a determined predisposition to osteoarthritis. 

[0085] The nucleic acid sample typically is isolated from a biological sample obtained from a 
subject. For example, nucleic acid can be isolated from blood, saliva, sputum, urine, cell scrapings, and 
biopsy tissue. The nucleic acid sample can be isolated from a biological sample using standard 
techniques, such as the technique described in Example 2. As used herein, the term "subject" refers 
primarily to humans but also refers to other mammals such as dogs, cats, and ungulates {e.g., cattle, 
sheep, and swine). Subjects also include avians {e.g., chickens and turkeys), reptiles, and fish {e.g., 
salmon), as embodiments described herein can be adapted to nucleic acid samples isolated from any of 
these organisms. The nucleic acid sample may be isolated from the subject and then directly utilized in a 
method for determining the presence of a polymorphic variant, or alternatively, the sample may be 
isolated and then stored {e.g., frozen) for a period of time before being subjected to analysis. 

[0086] The presence or absence of a polymorphic variant is determined using one or both 
chromosomal complements represented in the nucleic acid sample. Determining the presence or absence 
of a polymorphic variant in both chromosomal complements represented in a nucleic acid sample from a 
subject having a copy of each chromosome is useful for determining the zygosity of an individual for the 
polymorphic variant {i.e., whether the individual is homozygous or heterozygous for the polymorphic 
variant). Any oligonucleotide-based diagnostic may be utilized to determine whether a sample includes 
the presence or absence of a polymorphic variant in a sample. For example, primer extension methods, 
ligase sequence determination methods {e.g., U.S. Pat. Nos. 5,679,524 and 5,952,174, and WO 
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01/27326), mismatch sequence determination methods (e.g., U.S. Pat. Nos. 5,851,770; 5,958,692; 
6,110,684; and 6,183,958), microarray sequence determination methods, restriction fragment length 
polymorphism (RFLP), single strand conformation polymorphism detection (SSCP) (e.g., U.S. Pat. Nos. 
5,891,625 and 6,013,499), PCR-based assays (e.g., TAQMAN® PCR System (Applied Biosystems)), and 
nucleotide sequencing methods may be used. 

[0087] Oligonucleotide extension methods typically involve providing a pair of oligonucleotide 
primers in a polymerase chain reaction (PCR) or in other nucleic acid amplification methods for the 
purpose of amplifying a region from the nucleic acid sample that comprises the polymorphic variation. 
One oligonucleotide primer is complementary to a region 3 ' of the polymorphism and the other is 
complementary to a region 5' of the polymorphism. A PCR primer pair may be used in methods 
disclosed in U.S. Pat. Nos. 4,683,195; 4,683,202, 4,965,188; 5,656,493; 5,998,143; 6,140,054; WO 
01/27327; and WO 01/27329 for example. PCR primer pairs may also be used in any commercially 
available machines that perform PCR, such as any of the GENEAMP® Systems available from Applied 
Biosystems. Also, those of ordinary skill in the art will be able to design oligonucleotide primers based 
upon a TM7SF3 nucleotide sequence using knowledge available in the art. 

[0088] Also provided is an extension oligonucleotide that hybridizes to the amplified fragment 
adjacent to the polymorphic variation. As used herein, the term "adjacent" refers to the 3' end of the 
extension oligonucleotide being often 1 nucleotide from the 5' end of the polymorphic site, and 
sometimes 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides from the 5' end of the polymorphic site, in the nucleic 
acid when the extension oligonucleotide is hybridized to the nucleic acid. The extension oligonucleotide 
then is extended by one or more nucleotides, and the number and/or type of nucleotides that are added to 
the extension oligonucleotide determine whether the polymorphic variant is present. Oligonucleotide 
extension methods are disclosed, for example, in U.S. Pat. Nos. 4,656,127; 4,851,331; 5,679,524; 
5,834,189; 5,876,934; 5,908,755; 5,912,118; 5,976,802; 5,981,186; 6,004,744; 6,013,431; 6,017,702; 
6,046,005; 6,087,095; 6,210,891; and WO 01/20039. Oligonucleotide extension methods using mass 
spectrometry are described, for example, in U.S. Pat. Nos. 5,547,835; 5,605,798; 5,691,141; 5,849,542; 
5,869,242; 5,928,906; 6,043,031; and 6,194,144, and a method often utilized is described herein in 
Example 2. 

[0089] A microarray can be utilized for determining whether a polymorphic variant is present or 

absent in a nucleic acid sample. A microarray may include any oligonucleotides described herein, and 

methods for making and using oligonucleotide microarrays suitable for diagnostic use are disclosed in 

U.S. Pat. Nos. 5,492,806; 5,525,464; 5,589,330; 5,695,940; 5,849,483; 6,018,041; 6,045,996; 6,136,541; 

6,142,681; 6,156,501; 6,197,506; 6,223,127; 6,225,625; 6,229,91 1; 6,239,273; WO 00/52625; WO 

01/25485; and WO 01/29259. The microarray typically comprises a solid support and the 

oligonucleotides may be linked to this solid support by covalent bonds or by non-covalent interactions. 
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The oligonucleotides may also be linked to the solid support directly or by a spacer molecule. A 
microarray may comprise one or more oligonucleotides complementary to a polymorphic site set forth 
herein. 

[0090] A kit also may be utilized for determining whether a polymorphic variant is present or absent 
in a nucleic acid sample. A kit often comprises one or more pairs of oligonucleotide primers useful for 
amplifying a fragment of a nucleotide sequence of SEQ ID NO: 1 -2 or a substantially identical sequence 
thereof, where the fragment includes a polymorphic site. The kit sometimes comprises a polymerizing 
agent, for example, a thermostable nucleic acid polymerase such as one disclosed in U.S. Pat. Nos. 
4,889,818 or 6,077,664. Also, the kit often comprises an elongation oligonucleotide that hybridizes to a 
TM7SF3 nucleotide sequence in a nucleic acid sample adjacent to the polymorphic site. Where the kit 
includes an elongation oligonucleotide, it also often comprises chain elongating nucleotides, such as 
dATP, dTTP, dGTP, dCTP, and dITP, including analogs of dATP, dTTP, dGTP, dCTP and dITP, 
provided that such analogs are substrates for a thermostable nucleic acid polymerase and can be 
incorporated into a nucleic acid chain elongated from the extension oligonucleotide. Along with chain 
elongating nucleotides would be one or more chain terminating nucleotides such as ddATP, ddTTP, 
ddGTP, ddCTP, and the like. In an embodiment, the kit comprises one or more oligonucleotide primer 
pairs, a polymerizing agent, chain elongating nucleotides, at least one elongation oligonucleotide, and 
one or more chain terminating nucleotides. Kits optionally include buffers, vials, microtiter plates, and 
instructions for use. 

[0091] An individual identified as being at risk of osteoarthritis may be heterozygous or 
homozygous with respect to the allele associated with a higher risk of osteoarthritis. A subject 
homozygous for an allele associated with an increased risk of osteoarthritis is at a comparatively high 
risk of osteoarthritis, a subject heterozygous for an allele associated with an increased risk of 
osteoarthritis is at a comparatively intermediate risk of osteoarthritis, and a subject homozygous for an 
allele associated with a decreased risk of osteoarthritis is at a comparatively low risk of osteoarthritis. A 
genotype may be assessed for a complementary strand, such that the complementary nucleotide at a 
particular position is detected. 

[0092] Also featured are methods for determining risk of osteoarthritis and/or identifying a subject 
at risk of osteoarthritis by contacting a polypeptide or protein encoded by a TM7SF3 nucleotide sequence 
from a subject with an antibody that specifically binds to an epitope associated with increased risk of 
osteoarthritis in the polypeptide. 

Applications of Prognostic and Diagnostic Results to Pharmaco genomic Methods 
[0093] Pharmacogenomics is a discipline that involves tailoring a treatment for a subject according 
to the subject's genotype as a particular treatment regimen may exert a differential effect depending upon 
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the subject's genotype. For example, based upon the outcome of a prognostic test described herein, a 
clinician or physician may target pertinent information and preventative or therapeutic treatments to a 
subject who would be benefited by the information or treatment and avoid directing such information and 
treatments to a subject who would not be benefited (e.g., the treatment has no therapeutic effect and/or 
the subject experiences adverse side effects). 

[0094] The following is an example of a pharmacogenomic embodiment. A particular treatment 
regimen can exert a differential effect depending upon the subject's genotype. Where a candidate 
therapeutic exhibits a significant interaction with a major allele and a comparatively weak interaction 
with a minor allele (e.g. , an order of magnitude or greater difference in the interaction), such a 
therapeutic typically would not be administered to a subject genotyped as being homozygous for the 
minor allele, and sometimes not administered to a subject genotyped as being heterozygous for the minor 
allele. In another example, where a candidate therapeutic is not significantly toxic when administered to 
subjects who are homozygous for a major allele but is comparatively toxic when administered to subjects 
heterozygous or homozygous for a minor allele, the candidate therapeutic is not typically administered to 
subjects who are genotyped as being heterozygous or homozygous with respect to the minor allele. 

[0095] The methods described herein are applicable to pharmacogenomic methods for preventing, 
alleviating or treating osteoarthritis. For example, a nucleic acid sample from an individual may be 
subjected to a prognostic test described herein. Where one or more polymorphic variations associated 
with increased risk of osteoarthritis are identified in a subject, information for preventing or treating 
osteoarthritis and/or one or more osteoarthritis treatment regimens then may be prescribed to that subject. 

[0096] In certain embodiments, a treatment or preventative regimen is specifically prescribed and/or 
administered to individuals who will most benefit from it based upon their risk of developing 
osteoarthritis assessed by the methods described herein. Thus, provided are methods for identifying a 
subject predisposed to osteoarthritis and then prescribing a therapeutic or preventative regimen to 
individuals identified as having a predisposition. Thus, certain embodiments are directed to a method for 
reducing osteoarthritis in a subject, which comprises: detecting the presence or absence of a polymorphic 
variant associated with osteoarthritis in a nucleotide sequence in a nucleic acid sample from a subject, 
where the nucleotide sequence comprises a polynucleotide sequence selected from the group consisting 
of: (a) a nucleotide sequence of SEQ ID NO: 1 -2; (b) a nucleotide sequence which encodes a polypeptide 
consisting of an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1-2; (c) a 
nucleotide sequence which encodes a polypeptide that is 90% or more identical to an amino acid 
sequence encoded by a nucleotide sequence of SEQ ID NO: 1-2, or a nucleotide sequence about 90% or 
more identical to a nucleotide sequence of SEQ ID NO: 1-2; and (d) a fragment of a polynucleotide 
sequence of (a), (b), or (c); and prescribing or administering a treatment regimen to a subject from whom 
the sample originated where the presence of a polymorphic variation associated with osteoarthritis is 
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detected in the nucleotide sequence. In these methods, predisposition results may be utilized in 
combination with other test results to diagnose osteoarthritis. 

[0097] Certain preventative treatments often are prescribed to subjects having a predisposition to 
osteoarthritis and where the subject is diagnosed with osteoarthritis or is diagnosed as having symptoms 
indicative of an early stage of osteoarthritis. The treatment sometimes is preventative (e.g., is prescribed 
or administered to reduce the probability that osteoarthritis arises or progresses), sometimes is 
therapeutic, and sometimes delays, alleviates or halts the progression of osteoarthritis. Any known 
preventative or therapeutic treatment for alleviating or preventing the occurrence of osteoarthritis is 
prescribed and/or administered. For example, the treatment often is directed to decreasing pain and 
improving joint movement. Examples of OA treatments include exercises to keep joints flexible and 
improve muscle strength. Different medications to control pain, including corticosteroids and 
nonsteroidal anti-inflammatory drugs (NSAIDs, e.g., Voltaren); cyclooxygenase-2 (COX-2) inhibitors 
(e.g., Celebrex, Vioxx, Mobic, and Bextra); monoclonal antibodies (e.g., Remicade); tumor necrosis 
factor inhibitors (e.g., Enbrel); or injections of glucocorticoids, hyaluronic acid or chondrotin sulfate into 
joints that are inflamed and not responsive to NSAIDS. Orally administered chondroitin sulfate also may 
be used as a therapeutic, as it may increase hyaluronic acid levels and viscosity of synovial fluid, and 
decrease collagenase levels in synovial fluid. Also, glucosamine can serve as an OA therapeutic as 
delivering it into joints may inhibit enzymes involved in cartilage degradation and enhance the 
production of hyaluronic acid. For mild pain without inflammation, acetaminophen may be used. Other 
treatments include: heat/cold therapy for temporary pain relief; joint protection to prevent strain or stress 
on painful joints; surgery to relieve chronic pain in damaged joints; and weight control to prevent extra 
stress on weight-bearing joints. 

[0098] As therapeutic approaches for treating osteoarthritis continue to evolve and improve, the goal 
of treatments for osteoarthritis related disorders is to intervene even before clinical signs first manifest. 
Thus, genetic markers associated with susceptibility to osteoarthritis prove useful for early diagnosis, 
prevention and treatment of osteoarthritis. 

[0099] As osteoarthritis preventative and treatment information can be specifically targeted to 
subjects in need thereof (e.g., those at risk of developing osteoarthritis or those in an early stage of 
osteoarthritis), provided herein is a method for preventing or reducing the risk of developing 
osteoarthritis in a subject, which comprises: (a) detecting the presence or absence of a polymorphic 
variation associated with osteoarthritis at a polymorphic site in a nucleotide sequence in a nucleic acid 
sample from a subject; (b) identifying a subject with a predisposition to osteoarthritis, whereby the 
presence of the polymorphic variation is indicative of a predisposition to osteoarthritis in the subject; and 
(c) if such a predisposition is identified, providing the subject with information about methods or 
products to prevent or reduce osteoarthritis or to delay the onset of osteoarthritis. Also provided is a 
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method of targeting information or advertising to a subpopulation of a human population based on the 
subpopulation being genetically predisposed to a disease or condition, which comprises: (a) detecting the 
presence or absence of a polymorphic variation associated with osteoarthritis at a polymorphic site in a 
nucleotide sequence in a nucleic acid sample from a subject; (b) identifying the subpopulation of subjects 
in which the polymorphic variation is associated with osteoarthritis; and (c) providing information only 
to the subpopulation of subjects about a particular product which may be obtained and consumed or 
applied by the subject to help prevent or delay onset of the disease or condition. 

[0100] Pharmacogenomics methods also may be used to analyze and predict a response to 
osteoarthritis treatment or a drug. For example, if pharmacogenomics analysis indicates a likelihood that 
an individual will respond positively to osteoarthritis treatment with a particular drug, the drug may be 
administered to the individual. Conversely, if the analysis indicates that an individual is likely to respond 
negatively to treatment with a particular drug, an alternative course of treatment may be prescribed. A 
negative response may be defined as either the absence of an efficacious response or the presence of toxic 
side effects. The response to a therapeutic treatment can be predicted in a background study in which 
subjects in any of the following populations are genotyped: a population that responds favorably to a 
treatment regimen, a population that does not respond significantly to a treatment regimen, and a 
population that responds adversely to a treatment regimen (e.g., exhibits one or more side effects)., These 
populations are provided as examples and other populations and subpopulations may be analyzed. Based 
upon the results of these analyses, a subject is genotyped to predict whether he or she will respond 
favorably to a treatment regimen, not respond significantly to a treatment regimen, or respond adversely 
to a treatment regimen. 

[0101] The tests described herein also are applicable to clinical drug trials. One or more 
polymorphic variants indicative of response to an agent for treating osteoarthritis or to side effects to an 
agent for treating osteoarthritis may be identified using the methods described herein. Thereafter, 
potential participants in clinical trials of such an agent may be screened to identify those individuals most 
likely to respond favorably to the drug and exclude those likely to experience side effects. In that way, 
the effectiveness of drug treatment may be measured in individuals who respond positively to the drug, 
without lowering the measurement as a result of the inclusion of individuals who are unlikely to respond 
positively in the study and without risking undesirable safety problems. 

[0102] Thus, another embodiment is a method of selecting an individual for inclusion in a clinical 
trial of a treatment or drug comprising the steps of: (a) obtaining a nucleic acid sample from an 
individual; (b) determining the identity of a polymorphic variation which is associated with a positive 
response to the treatment or the drug, or at least one polymorphic variation which is associated with a 
negative response to the treatment or the drug in the nucleic acid sample, and (c) including the individual 
in the clinical trial if the nucleic acid sample contains said polymorphic variation associated with a 
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positive response to the treatment or the drug or if the nucleic acid sample lacks said polymorphic 
variation associated with a negative response to the treatment or the drug. In addition, the methods 
described herein for selecting an individual for inclusion in a clinical trial of a treatment or drug 
encompass methods with any further limitation described in this disclosure, or those following, specified 
alone or in any combination. The polymorphic variation may be in a sequence selected individually or in 
any combination from the group consisting of (i) a nucleotide sequence of SEQ ED NO: 1-2; (ii) a 
nucleotide sequence which encodes a polypeptide consisting of an amino acid sequence encoded by a 
nucleotide sequence of SEQ ID NO: 1-2; (iii) a nucleotide sequence which encodes a polypeptide that is 
90% or more identical to an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1 -2, 
or a nucleotide sequence about 90% or more identical to a nucleotide sequence of SEQ ID NO: 1-2; and 
(iv) a fragment of a polynucleotide sequence of (i), (ii), or (iii) comprising the polymorphic site. The 
including step (c) optionally comprises administering the drug or the treatment to the individual if the 
nucleic acid sample contains the polymorphic variation associated with a positive response to the 
treatment or the drug and the nucleic acid sample lacks said biallelic marker associated with a negative 
response to the treatment or the drug. 

[0103] Also provided herein is a method of partnering between a diagnostic/prognostic testing 
provider and a provider of a consumable product, which comprises: (a) the diagnostic/prognostic testing 
provider detects the presence or absence of a polymorphic variation associated with osteoarthritis at a 
polymorphic site in a nucleotide sequence in a nucleic acid sample from a subject; (b) the 
diagnostic/prognostic testing provider identifies the subpopulation of subjects in which the polymorphic 
variation is associated with osteoarthritis; (c) the diagnostic/prognostic testing provider forwards 
information to the subpopulation of subjects about a particular product which may be obtained and 
consumed or applied by the subject to help prevent or delay onset of the disease or condition; and (d) the 
provider of a consumable product forwards to the diagnostic test provider a fee every time the 
diagnostic/prognostic test provider forwards information to the subject as set forth in step (c) above. 

Compositions Comprising Osteoarthritis-Directed Molecules 

[0104] Featured herein is a composition comprising a cell from a subject having osteoarthritis or at 
risk of osteoarthritis and one or more molecules specifically directed and targeted to a nucleic acid 
comprising a TM7SF3 nucleotide sequence or amino acid sequence. Such directed molecules include, 
but are not limited to, a compound that binds to a TM7SF3 nucleotide sequence or amino acid sequence 
referenced herein; a RNAi or siRNA molecule having a strand complementary or substantially 
complementary to a TM7SF3 nucleotide sequence (e.g., hybridizes to a TM7SF3 nucleotide sequence 
under conditions of high stringency); an antisense nucleic acid complementary or substantially 
complementary to an RNA encoded by a TM7SF3 nucleotide sequence (e.g., hybridizes to a TM7SF3 
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nucleotide sequence under conditions of high stringency); a ribozyme that hybridizes to a TM7SF3 
nucleotide sequence (e.g., hybridizes to a TM7SF3 nucleotide sequence under conditions of high 
stringency); a nucleic acid aptamer that specifically binds a polypeptide encoded by TM7SF3 nucleotide 
sequence; and an antibody that specifically binds to a polypeptide encoded by TM7SF3 nucleotide 
sequence or binds to a nucleic acid having such a nucleotide sequence. In specific embodiments, the 
osteoarthritis directed molecule interacts with a nucleic acid or polypeptide variant associated with 
osteoarthritis, such as variants referenced herein. In other embodiments, the osteoarthritis directed 
molecule interacts with a polypeptide involved in a signal pathway of a polypeptide encoded by a 
TM7SF3 nucleotide sequence, or a nucleic acid comprising such a nucleotide sequence. 

[0105] Compositions sometimes include an adjuvant known to stimulate an immune response, and 
in certain embodiments, an adjuvant that stimulates a T-cell lymphocyte response. Adjuvants are known, 
including but not limited to an aluminum adjuvant (e.g., aluminum hydroxide); a cytokine adjuvant or 
adjuvant that stimulates a cytokine response (e.g., interleukin (IL)-12 and/or gamma-interferon 
cytokines); a Freund-type mineral oil adjuvant emulsion (e.g., Freund's complete or incomplete 
adjuvant); a synthetic lipoid compound; a copolymer adjuvant (e.g., TitreMax); a saponin; Quil A; a 
liposome; an oil-in-water emulsion (e.g., an emulsion stabilized by Tween 80 and pluronic 
polyoxyethlene/polyoxypropylene block copolymer (Syntex Adjuvant Formulation); TitreMax; 
detoxified endotoxin (MPL) and mycobacterial cell wall components (TDW, CWS) in 2% squalene (Ribi 
Adjuvant System)); a muramyl dipeptide; an immune-stimulating complex (ISCOM, e.g., an Ag- 
modified saponin/cholesterol micelle that forms stable cage-like structure); an aqueous phase adjuvant 
that does not have a depot effect (e.g., Gerbu adjuvant); a carbohydrate polymer (e.g., AdjuPrime); L- 
tyrosine; a manide-oleate compound (e.g., Montanide); an ethylene-vinyl acetate copolymer (e.g., Elvax 
40W1,2); or lipid A, for example. Such compositions are useful for generating an immune response 
against osteoarthritis directed molecule (e.g., an HLA-binding subsequence within a polypeptide encoded 
by a TM7SF3 nucleotide sequence). In such methods, a peptide having an amino acid subsequence of a 
polypeptide encoded by a TM7SF3 nucleotide sequence is delivered to a subject, where the subsequence 
binds to an HLA molecule and induces a CTL lymphocyte response. The peptide sometimes is delivered 
to the subject as an isolated peptide or as a minigene in a plasmid that encodes the peptide. Methods for 
identifying HLA-binding subsequences in such polypeptides are known (see e.g., publication 
WO02/20616 and PCT application US98/01373 for methods of identifying such sequences). 

[0106] The cell may be in a group of cells cultured in vitro or in a tissue maintained in vitro or 
present in an animal in vivo (e.g., a rat, mouse, ape or human). In certain embodiments, a composition 
comprises a component from a cell such as a nucleic acid molecule (e.g., genomic DNA), a protein 
mixture or isolated protein, for example. The aforementioned compositions have utility in diagnostic, 
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prognostic and pharmacogenomic methods described previously and in therapeutics described hereafter. 
Certain osteoarthritis directed molecules are described in greater detail below. 

Compounds 

[0107] Compounds can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; peptoid libraries (libraries of molecules having 
the functionalities of peptides, but with a novel, non-peptide backbone which are resistant to enzymatic 
degradation but which nevertheless remain bioactive (see, e.g., Zuckermann et al., J. Med. Chem.37: 
2678-85 (1994)); spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconvolution; "one-bead one-compound" library methods; and synthetic library 
methods using affinity chromatography selection. Biological library and peptoid library approaches are 
typically limited to peptide libraries, while the other approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam, Anticancer Drug Des. 12: 145, (1997)). 
Examples of methods for synthesizing molecular libraries are described, for example, in DeWitt et al., 
Proc. Natl. Acad. Sci. U.S.A. 90: 6909 (1993); Erb et al., Proc. Natl. Acad. Sci. USA 91 : 1 1422 (1994); 
Zuckermann et al., J. Med. Chem. 37: 2678 (1994); Cho et al., Science 261 : 1303 (1993); Carrell et al., 
Angew. Chem. Int. Ed. Engl. 33: 2059 (1994); Carell et al., Angew. Chem. Int. Ed. Engl. 33: 2061 
(1994); and in Gallop et al., J. Med. Chem. 37: 1233 (1994). 

[0108] Libraries of compounds may be presented in solution (e.g., Houghten, Biotechniques 13: 
412-421 (1992)), or on beads (Lam, Nature 354: 82-84 (1991)), chips (Fodor, Nature 364: 555-556 
(1993)), bacteria or spores (Ladner, United States Patent No. 5,223,409), plasmids (Cull et al., Proc. Natl. 
Acad. Sci. USA 89: 1865-1869 (1992)) or on phage (Scott and Smith, Science 249: 386-390 (1990); 
Devlin, Science 249: 404-406 (1990); Cwirla et al., Proc. Natl. Acad. Sci. 87: 6378-6382 (1990); Felici, 
J. Mol. Biol. 222: 301-310 (1991); Ladner supra.). 

[0109] A compound sometimes alters expression and sometimes alters activity of a polypeptide 
target and may be a small molecule. Small molecules include, but are not limited to, peptides, 
peptidomimetics (e.g., peptoids), amino acids, amino acid analogs, polynucleotides, polynucleotide 
analogs, nucleotides, nucleotide analogs, organic or inorganic compounds (i.e., including heteroorganic 
and organometallic compounds) having a molecular weight less than about 10,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 5,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 1 ,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 500 grams per mole, and 
salts, esters, and other pharmaceutically acceptable forms of such compounds. 
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Antisense Nucleic Acid Molecules. Ribozymes. RNAi. siRNA and Modified Nucleic 
Acid Molecules 

[0110] An "antisense" nucleic acid refers to a nucleotide sequence complementary to a "sense" 
nucleic acid encoding a polypeptide, e.g., complementary to the coding strand of a double-stranded 
cDNA molecule or complementary to an mRNA sequence. The antisense nucleic acid can be 
complementary to an entire coding strand, or to a portion thereof or a substantially identical sequence 
thereof. In another embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" 
of the coding strand of a nucleotide sequence (e.g., 5' and 3' untranslated regions in SEQ ID NO: 1). 

[0111] An antisense nucleic acid can be designed such that it is complementary to the entire coding 
region of an mRNA encoded by a nucleotide sequence (e.g., SEQ ID NO: 1 ), and often the antisense 
nucleic acid is an oligonucleotide antisense to only a portion of a coding or noncoding region of the 
mRNA. For example, the antisense oligonucleotide can be complementary to the region surrounding the 
translation start site of the mRNA, e.g., between the -10 and +10 regions of the target gene nucleotide 
sequence of interest. An antisense oligonucleotide can be, for example, about 7, 10, 15, 20, 25, 30, 35, 
40, 45, 50, 55, 60, 65, 70, 75, 80, or more nucleotides in length. The antisense nucleic acids, which 
include the ribozymes described hereafter, can be designed to target a TM7SF3 nucleotide sequence, 
often a variant associated with osteoarthritis, or a substantially identical sequence thereof. Among the 
variants, minor alleles and major alleles can be targeted, and those associated with a higher risk of 
osteoarthritis are often designed, tested, and administered to subjects. 

[0112] An antisense nucleic acid can be constructed using chemical synthesis and enzymatic ligation 
reactions using standard procedures. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously 
modified nucleotides designed to increase the biological stability of the molecules or to increase the 
physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., 
phosphorothioate derivatives and acridine substituted nucleotides can be used. Antisense nucleic acid 
also can be produced biologically using an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of interest, described further in the following subsection). 

[0113] When utilized as therapeutics, antisense nucleic acids typically are administered to a subject 
(e.g., by direct injection at a tissue site) or generated in situ such that they hybridize with or bind to 
cellular mRNA and/or genomic DNA encoding a polypeptide and thereby inhibit expression of the 
polypeptide, for example, by inhibiting transcription and/or translation. Alternatively, antisense nucleic 
acid molecules can be modified to target selected cells and then are administered systemically. For 
systemic administration, antisense molecules can be modified such that they specifically bind to receptors 
or antigens expressed on a selected cell surface, for example, by linking antisense nucleic acid molecules 
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to peptides or antibodies which bind to cell surface receptors or antigens. Antisense nucleic acid 
molecules can also be delivered to cells using the vectors described herein. Sufficient intracellular 
concentrations of antisense molecules are achieved by incorporating a strong promoter, such as a pol II or 
pol III promoter, in the vector construct. 

[0114] Antisense nucleic acid molecules sometimes are alpha-anomeric nucleic acid molecules. An 
alpha-anomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA 
in which, contrary to the usual beta-units, the strands run parallel to each other (Gaultier et al., Nucleic 
Acids. Res. 15: 6625-6641 (1987)). Antisense nucleic acid molecules can also comprise a 2'-o- 
methylribonucleotide (Inoue et al., Nucleic Acids Res. 15: 6131-6148 (1987)) or a chimeric RNA-DNA 
analogue (Inoue et al., FEBS Lett. 215: 327-330 (1987)). Antisense nucleic acids sometimes are 
composed of DNA or PNA or any other nucleic acid derivatives described previously. 

[0115] In another embodiment, an antisense nucleic acid is a ribozyme. A ribozyme having 
specificity for a TM7SF3 nucleotide sequence can include one or more sequences complementary to such 
a nucleotide sequence, and a sequence having a known catalytic region responsible for mRNA cleavage 
(see e.g., U.S. Pat. No. 5,093,246 or Haselhoff and Gerlach, Nature 334: 585-591 (1988)). For example, 
a derivative of a Tetrahymena L-19 IVS RNA is sometimes utilized in which the nucleotide sequence of 
the active site is complementary to the nucleotide sequence to be cleaved in a mRNA (see e.g., Cech et 
al. U.S. Patent No. 4,987,071; and Cech et al. U.S. Patent No. 5,1 16,742). Also, target mRNA sequences 
can be used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA 
molecules (see e.g., Bartel & Szostak, Science 261: 1411-1418 (1993)). 

[0116] Osteoarthritis directed molecules include in certain embodiments nucleic acids that can form 
triple helix structures with a TM7SF3 nucleotide sequence, or a substantially identical sequence thereof, 
especially one that includes a regulatory region that controls expression of a polypeptide. Gene 
expression can be inhibited by targeting nucleotide sequences complementary to the regulatory region of 
a nucleotide sequence referenced herein or a substantially identical sequence (e.g., promoter and/or 
enhancers) to form triple helical structures that prevent transcription of a gene in target cells (see e.g., 
Helene, Anticancer Drug Des. 6(6): 569-84 (1991); Helene et al., Ann. N.Y. Acad. Sci. 660: 27-36 
(1992); and Maher, Bioassays 14(12): 807-15 (1992). Potential sequences that can be targeted for triple 
helix formation can be increased by creating a so-called "switchback" nucleic acid molecule. Switchback 
molecules are synthesized in an alternating 5 '-3', 3 '-5' manner, such that they base pair with first one 
strand of a duplex and then the other, eliminating the necessity for a sizeable stretch of either purines or 
pyrimidines to be present on one strand of a duplex. 

[01 17] Osteoarthritis directed molecules include RNAi and siRNA nucleic acids. Gene expression 
may be inhibited by the introduction of double-stranded RNA (dsRNA), which induces potent and 
specific gene silencing, a phenomenon called RNA interference or RNAi. See, e.g., Fire et al., US Patent 
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Number 6,506,559; Tuschl et al. PCT International Publication No. WO 01/75164; Kay et al. PCT 
International Publication No. WO 03/01 0180A1; or Bosher JM, Labouesse, Nat Cell Biol 2000 
Feb;2(2):E31-6. This process has been improved by decreasing the size of the double-stranded RNA to 
20-24 base pairs (to create small-interfering RNAs or siRNAs) that "switched off genes in mammalian 
cells without initiating an acute phase response, i.e., a host defense mechanism that often results in cell 
death (see, e.g., Caplen et al. Proc Natl Acad Sci USA. 2001 Aug 14;98(1 7):9742-7 and Elbashir et al. 
Methods 2002 Feb;26(2): 199-2 13). There is increasing evidence of post-transcriptional gene silencing 
by RNA interference (RNAi) for inhibiting targeted expression in mammalian cells at the mRNA level, 
in human cells. There is additional evidence of effective methods for inhibiting the proliferation and 
migration of tumor cells in human patients, and for inhibiting metastatic cancer development (see, e.g., 
U.S. Patent Application No. US200 1000993 183; Caplen et al. Proc Natl Acad Sci U S A; and 
Abderrahmani et al. Mol Cell Biol 2001 Nov21(21):7256-67). 

[0118] An "siRNA" or "RNAi" refers to a nucleic acid that forms a double stranded RNA and has 
the ability to reduce or inhibit expression of a gene or target gene when the siRNA is delivered to or 
expressed in the same cell as the gene or target gene. "siRNA" refers to short double-stranded RNA 
formed by the complementary strands. Complementary portions of the siRNA that hybridize to form the 
double stranded molecule often have substantial or complete identity to the target molecule sequence. In 
one embodiment, an siRNA refers to a nucleic acid that has substantial or complete identity to a target 
gene and forms a double stranded siRNA. 

[0119] When designing the siRNA molecules, the targeted region often is selected from a given 
DNA sequence beginning 50 to 100 nucleotides downstream of the start codon. See, e.g., Elbashir et al,. 
Methods 26:199-213 (2002). Initially, 5' or 3' UTRs and regions nearby the start codon were avoided 
assuming that UTR-binding proteins and/or translation initiation complexes may interfere with binding of 
the siRNP or RISC endonuclease complex. Sometimes regions of the target 23 nucleotides in length 
conforming to the sequence motif AA(N19)TT (N, an nucleotide), and regions with approximately 30% 
to 70% G/C-content (often about 50% G/C-content) often are selected. If no suitable sequences are 
found, the search often is extended using the motif NA(N21). The sequence of the sense siRNA 
sometimes corresponds to (N19) TT or N21 (position 3 to 23 of the 23-nt motif), respectively. In the 
latter case, the 3' end of the sense siRNA often is converted to TT. The rationale for this sequence 
conversion is to generate a symmetric duplex with respect to the sequence composition of the sense and 
antisense 3' overhangs. The antisense siRNA is synthesized as the complement to position 1 to 21 of the 
23-nt motif. Because position 1 of the 23-nt motif is not recognized sequence-specifically by the 
antisense siRNA, the 3 '-most nucleotide residue of the antisense siRNA can be chosen deliberately. 
However, the penultimate nucleotide of the antisense siRNA (complementary to position 2 of the 23-nt 
motif) often is complementary to the targeted sequence. For simplifying chemical synthesis, TT often is 
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utilized. siRNAs corresponding to the target motif NAR(N 1 7)YNN, where R is purine (A,G) and Y is 
pyrimidine (C,U), often are selected. Respective 21 nucleotide sense and antisense siRNAs often begin 
with a purine nucleotide and can also be expressed from pol HI expression vectors without a change in 
targeting site. Expression of RNAs from pol HI promoters often is efficient when the first transcribed 
nucleotide is a purine. 

[0120] The sequence of the siRNA can correspond to the full length target gene, or a subsequence 
thereof. Often, the siRNA is about 15 to about 50 nucleotides in length (e.g., each complementary 
sequence of the double stranded siRNA is 15-50 nucleotides in length, and the double stranded siRNA is 
about 15-50 base pairs in length, sometimes about 20-30 nucleotides in length or about 20-25 nucleotides 
in length, e.g., 20, 21 , 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides in length. The siRNA sometimes 
is about 21 nucleotides in length. Methods of using siRNA are well known in the art, and specific siRNA 
molecules may be purchased from a number of companies including Dharmacon Research, Inc. 

[0121] Antisense, ribozyme, RNAi and siRNA nucleic acids can be altered to form modified nucleic 
acid molecules. The nucleic acids can be altered at base moieties, sugar moieties or phosphate backbone 
moieties to improve stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of nucleic acid molecules can be modified to generate peptide nucleic acids (see 
Hyrup et al., Bioorganic & Medicinal Chemistry 4 (1): 5-23 (1996)). As used herein, the terms "peptide 
nucleic acid" or "PNA" refers to a nucleic acid mimic such as a DNA mimic, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are 
retained. The neutral backbone of a PNA can allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. Synthesis of PNA oligomers can be performed using standard solid 
phase peptide synthesis protocols as described, for example, in Hyrup et al., (1996) supra and Perry- 
O'Keefe et al., Proc. Natl. Acad. Sci. 93: 14670-675 (1996). 

[0122] PNA nucleic acids can be used in prognostic, diagnostic, and therapeutic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 
expression by, for example, inducing transcription or translation arrest or inhibiting replication. PNA 
nucleic acid molecules can also be used in the analysis of single base pair mutations in a gene, (e.g., by 
PNA-directed PCR clamping); as "artificial restriction enzymes" when used in combination with other 
enzymes, (e.g., SI nucleases (Hyrup (1996) supra)); or as probes or primers for DNA sequencing or 
hybridization (Hyrup et al., (1996) supra; Perry-O'Keefe supra). 

[0123] In other embodiments, oligonucleotides may include other appended groups such as peptides 
(e.g., for targeting host cell receptors in vivo), or agents facilitating transport across cell membranes (see 
e.g., Letsinger et al., Proc. Natl. Acad. Sci. USA 86: 6553-6556 (1989); Lemaitre et al., Proc. Natl. Acad. 
Sci. USA 84: 648-652 (1987); PCT Publication No. W088/09810) or the blood-brain barrier (see, e.g., 
PCT Publication No. W089/10134). In addition, oligonucleotides can be modified with hybridization- 
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triggered cleavage agents (See, e.g., Krol et al., Bio-Techniques 6: 958-976 (1988)) or intercalating 
agents. (See, e.g., Zon, Pharm. Res. 5: 539-549 (1988) ). To this end, the oligonucleotide may be 
conjugated to another molecule, (e.g., a peptide, hybridization triggered cross-linking agent, transport 
agent, or hybridization-triggered cleavage agent). 

[0124] Also included herein are molecular beacon oligonucleotide primer and probe molecules 
having one or more regions complementary to a TM7SF3 nucleotide sequence, or a substantially identical 
sequence thereof, two complementary regions one having a fluorophore and one a quencher such that the 
molecular beacon is useful for quantifying the presence of the nucleic acid in a sample. Molecular 
beacon nucleic acids are described, for example, in Lizardi et al., U.S. Patent No. 5,854,033; Nazarenko 
et al., U.S. Patent No. 5,866,336, and Livak et al., U.S. Patent 5,876,930. 

Antibodies 

[0125] The term "antibody" as used herein refers to an immunoglobulin molecule or 
immunologically active portion thereof, i.e., an antigen-binding portion. Examples of immunologically 
active portions of immunoglobulin molecules include F(ab) and F(ab') 2 fragments which can be 
generated by treating the antibody with an enzyme such as pepsin. An antibody sometimes is a 
polyclonal, monoclonal, recombinant (e.g., a chimeric or humanized), fully human, non-human (e.g., 
murine), or a single chain antibody. An antibody may have effector function and can fix complement, 
and is sometimes coupled to a toxin or imaging agent. 

[0126] A full-length polypeptide or antigenic peptide fragment encoded by a nucleotide sequence 
referenced herein can be used as an immunogen or can be used to identify antibodies made with other 
immunogens, e.g., cells, membrane preparations, and the like. An antigenic peptide often includes at 
least 8 amino acid residues of the amino acid sequences encoded by a nucleotide sequence referenced 
herein, or substantially identical sequence thereof, and encompasses an epitope. Antigenic peptides 
sometimes include 10 or more amino acids, 15 or more amino acids, 20 or more amino acids, or 30 or 
more amino acids. Hydrophilic and hydrophobic fragments of polypeptides sometimes are used as 
immunogens. 

[0127] Epitopes encompassed by the antigenic peptide are regions located on the surface of the 
polypeptide (e.g., hydrophilic regions) as well as regions with high antigenicity. For example, an Emini 
surface probability analysis of the human polypeptide sequence can be used to indicate the regions that 
have a particularly high probability of being localized to the surface of the polypeptide and are thus likely 
to constitute surface residues useful for targeting antibody production. The antibody may bind an epitope 
on any domain or region on polypeptides described herein. 

[0128] Also, chimeric, humanized, and completely human antibodies are useful for applications 
which include repeated administration to subjects. Chimeric and humanized monoclonal antibodies, 
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comprising both human and non-human portions, can be made using standard recombinant DNA 
techniques. Such chimeric and humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in Robinson et al International 
Application No. PCT/US86/02269; Akira, et al European Patent Application 184,187; Taniguchi, M., 
European Patent Application 171,496; Morrison et al European Patent Application 173,494; Neuberger et 
al PCT International Publication No. WO 86/01533; Cabilly et al U.S. Patent No. 4,816,567; Cabilly et al 
European Patent Application 125,023; Better et al., Science 240: 1041-1043 (1988); Liu et al., Proc. Natl. 
Acad. Sci. USA 84: 3439-3443 (1987); Liu et al., J. Immunol. 139: 3521-3526 (1987); Sun et al., Proc. 
Natl. Acad. Sci. USA 84: 214-218 (1987); Nishimura et al., Cane. Res. 47: 999-1005 (1987); Wood et al., 
Nature 314: 446-449 (1985); and Shaw et al., J. Natl. Cancer Inst. 80: 1553-1559 (1988); Morrison, S. 
L., Science 229: 1202-1207 (1985); Oi et al., BioTechniques 4: 214 (1986); Winter U.S. Patent 
5,225,539; Jones et al., Nature 321: 552-525 (1986); Verhoeyan et al., Science 239: 1534; and Beidler et 
al., J. Immunol. 141 : 4053-4060 (1988). 

[0129] Completely human antibodies are particularly desirable for therapeutic treatment of human 
patients. Such antibodies can be produced using transgenic mice that are incapable of expressing 
endogenous immunoglobulin heavy and light chains genes, but which can express human heavy and light 
chain genes. See, for example, Lonberg and Huszar, Int. Rev. Immunol. 13: 65-93 (1995); and U.S. 
Patent Nos. 5,625,126; 5,633,425; 5,569,825; 5,661,016; and 5,545,806. In addition, companies such as 
Abgenix, Inc. (Fremont, CA) and Medarex, Inc. (Princeton, NJ), can be engaged to provide human 
antibodies directed against a selected antigen using technology similar to that described above. 
Completely human antibodies that recognize a selected epitope also can be generated using a technique 
referred to as "guided selection." In this approach a selected non-human monoclonal antibody (e.g., a 
murine antibody) is used to guide the selection of a completely human antibody recognizing the same 
epitope. This technology is described for example by Jespers et al., Bio/Technology 12: 899-903 (1994). 

[0130] An antibody can be a single chain antibody. A single chain antibody (scFV) can be 
engineered (see, e.g., Colcher et al., Ann. N Y Acad. Sci. 880: 263-80 (1999); and Reiter, Clin. Cancer 
Res. 2: 245-52 (1996)). Single chain antibodies can be dimerized or multimerized to generate 
multivalent antibodies having specificities for different epitopes of the same target polypeptide. 

[0131] Antibodies also may be selected or modified so that they exhibit reduced or no ability to bind 
an Fc receptor. For example, an antibody may be an isotype or subtype, fragment or other mutant, which 
does not support binding to an Fc receptor (e.g., it has a mutagenized or deleted Fc receptor binding 
region). 

[0132] Also, an antibody (or fragment thereof) may be conjugated to a therapeutic moiety such as a 
cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin or cytotoxic agent includes any 
agent that is detrimental to cells. Examples include taxol, cytochalasin B, gramicidin D, ethidium 
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bromide, emetine, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, 
daunorubicin, dihydroxy anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1 
dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and 
analogs or homologs thereof. Therapeutic agents include, but are not limited to, antimetabolites (e.g., 
methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents 
(e.g., mecnJorethamine, thiotepa chlorambucil, melphalan, carmustine (BCNU) and lomustine (CCNU), 
cyclophosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine 
platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and 
doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, mithramycin, and 
anthramycin (AMC)), and anti-mitotic agents (e.g., vincristine and vinblastine). 

[0133] Antibody conjugates can be used for modifying a given biological response. For example, 
the drug moiety may be a protein or polypeptide possessing a desired biological activity. Such proteins 
may include, for example, a toxin such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a 
polypeptide such as tumor necrosis factor, gamma-interferon, alpha-interferon, nerve growth factor, 
platelet derived growth factor, tissue plasminogen activator; or, biological response modifiers such as, for 
example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), interleukin-6 ("IL-6"), granulocyte 
macrophage colony stimulating factor ("GM-CSF"), granulocyte colony stimulating factor ("G-CSF"), or 
other growth factors. Also, an antibody can be conjugated to a second antibody to form an antibody 
heteroconjugate as described by Segal in U.S. Patent No. 4,676,980, for example. 

[0134] An antibody (e.g., monoclonal antibody) can be used to isolate target polypeptides by 
standard techniques, such as affinity chromatography or immunoprecipitation. Moreover, an antibody 
can be used to detect a target polypeptide (e.g., in a cellular lysate or cell supernatant) in order to evaluate 
the abundance and pattern of expression of the polypeptide. Antibodies can be used diagnostically to 
monitor polypeptide levels in tissue as part of a clinical testing procedure, e.g., to determine the efficacy 
of a given treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) the 
antibody to a detectable substance (i.e., antibody labeling). Examples of detectable substances include 
various enzymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, B-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group 
complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 
include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; 
examples of bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include 125 I, 131 1, 35 S or 3 H. Also, an antibody can be utilized as a test 
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molecule for determining whether it can treat osteoarthritis, and as a therapeutic for administration to a 
subject for treating osteoarthritis. 

[0135] An antibody can be made by immunizing with a purified antigen, or a fragment thereof, e.g., 
a fragment described herein, a membrane associated antigen, tissues, e.g., crude tissue preparations, 
whole cells, preferably living cells, lysed cells, or cell fractions. 

[0136] Included herein are antibodies which bind only a native polypeptide, only denatured or 
otherwise non-native polypeptide, or which bind both, as well as those having linear or conformational 
epitopes. Conformational epitopes sometimes can be identified by selecting antibodies that bind to 
native but not denatured polypeptide. Also featured are antibodies that specifically bind to a polypeptide 
variant associated with osteoarthritis. 

Methods for Identifying Candidate Therapeutics for Treating Osteoarthritis 

[0137] Current therapies for the treatment of osteoarthritis have limited efficacy, limited tolerability 
and significant mechanism-based side effects, and few of the available therapies adequately address 
underlying defects. Current therapeutic approaches were largely developed in the absence of defined 
molecular targets or even a solid understanding of disease pathogenesis. Therefore, provided are 
methods of identifying candidate therapeutics that target biochemical pathways related to the 
development of osteoarthritis. 

[0138] Thus, featured herein are methods for identifying a candidate therapeutic for treating 
osteoarthritis. The methods comprise contacting a test molecule with a target molecule in a system. A 
"target molecule" as used herein refers to a TM7SF3 nucleic acid, a substantially identical nucleic acid 
thereof, or a fragment thereof, and an encoded polypeptide of the foregoing. The methods also comprise 
determining the presence or absence of an interaction between the test molecule and the target molecule, 
where the presence of an interaction between the test molecule and the nucleic acid or polypeptide 
identifies the test molecule as a candidate osteoarthritis therapeutic. The interaction between the test 
molecule and the target molecule may be quantified. 

[0139] Test molecules and candidate therapeutics include, but are not limited to, compounds, 
antisense nucleic acids, siRNA molecules, ribozymes, polypeptides or proteins encoded by a TM7SF3 
nucleotide sequence, or a substantially identical sequence or fragment thereof, and immunotherapeutics 
(e.g., antibodies and HLA-presented polypeptide fragments). A test molecule or candidate therapeutic 
may act as a modulator of target molecule concentration or target molecule function in a system. A 
"modulator" may agonize (i.e., up-regulates) or antagonize (i.e., down-regulates) a target molecule 
concentration partially or completely in a system by affecting such cellular functions as DNA replication 
and/or DNA processing (e.g., DNA methylation or DNA repair), RNA transcription and/or RNA 
processing (e.g., removal of intronic sequences and/or translocation of spliced mRNA from the nucleus), 
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polypeptide production (e.g., translation of the polypeptide from mRNA), and/or polypeptide post- 
translational modification (e.g., glycosylation, phosphorylation, and proteolysis of pro-polypeptides). A 
modulator may also agonize or antagonize a biological function of a target molecule partially or 
completely, where the function may include adopting a certain structural conformation, interacting with 
one or more binding partners, ligand binding, catalysis (e.g., phosphorylation, dephosphorylation, 
hydrolysis, methylation, and isomerization), and an effect upon a cellular event (e.g., effecting 
progression of osteoarthritis). 

[0140] As used herein, the term "system" refers to a cell free in vitro environment and a cell-based 
environment such as a collection of cells, a tissue, an organ, or an organism. A system is "contacted" 
with a test molecule in a variety of manners, including adding molecules in solution and allowing them to 
interact with one another by diffusion, cell injection, and any administration routes in an animal. As used 
herein, the term "interaction" refers to an effect of a test molecule on test molecule, where the effect 
sometimes is binding between the test molecule and the target molecule, and sometimes is an observable 
change in cells, tissue, or organism. 

[0141] There are many standard methods for detecting the presence or absence of interaction 
between a test molecule and a target molecule. For example, titrametric, acidimetric, radiometric, NMR, 
monolayer, polarographic, spectre-photometric, fluorescent, and ESR assays probative of a target 
molecule interaction may be utilized. 

[0142] Test molecule/target molecule interactions can be detected and/or quantified using assays 
known in the art. For example, an interaction can be determined by labeling the test molecule and/or the 
target molecule, where the label is covalently or non-covalently attached to the test molecule or target 
molecule. The label is sometimes a radioactive molecule such as l25 I, m I, 35 S or 3 H, which can be 
detected by direct counting of radioemission or by scintillation counting. Also, enzymatic labels such as 
horseradish peroxidase, alkaline phosphatase, or luciferase may be utilized where the enzymatic label can 
be detected by determining conversion of an appropriate substrate to product. In addition, presence or 
absence of an interaction can be determined without labeling. For example, a microphysiometer (e.g. , 
Cytosensor) is an analytical instrument that measures the rate at which a cell acidifies its environment 
using a light-addressable potentiometric sensor (LAPS). Changes in this acidification rate can be used as 
an indication of an interaction between a test molecule and target molecule (McConnell, H. M. et ah, 
Science 257: 1906-1912 (1992)). 

[0143] In cell-based systems, cells typically include a TM7SF3 nucleic acid, an encoded 
polypeptide, or substantially identical nucleic acid or polypeptide thereof, and are often of mammalian 
origin, although the cell can be of any origin. Whole cells, cell homogenates, and cell fractions (e.g., cell 
membrane fractions) can be subjected to analysis. Where interactions between a test molecule with a 
target polypeptide are monitored, soluble and/or membrane bound forms of the polypeptide may be 
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utilized. Where membrane-bound forms of the polypeptide are used, it may be desirable to utilize a 
solubilizing agent. Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N- 

methylglucamide, Triton® X- 100, Triton® X-l 14, Thesit®, Isotridecypoly( ethylene glycol ether) n , 3- 

[(3-cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 3-[(3- 

cholarrudopropy l)dimethylamminio] -2 -hydroxy- 1 -propane sulfonate (CHAPSO), or N-dodecyl-N,N- 
dimethyl-3-ammonio-l -propane sulfonate. 

[0144] An interaction between a test molecule and target molecule also can be detected by 
monitoring fluorescence energy transfer (FET) (see, e.g., Lakowicz et al., U.S. Patent No. 5,631,169; 
Stavrianopoulos et al. U.S. Patent No. 4,868,103). A fluorophore label on a first, "donor" molecule is 
selected such that its emitted fluorescent energy will be absorbed by a fluorescent label on a second, 
"acceptor" molecule, which in turn is able to fluoresce due to the absorbed energy. Alternately, the 
"donor" polypeptide molecule may simply utilize the natural fluorescent energy of tryptophan residues. 
Labels are chosen that emit different wavelengths of light, such that the "acceptor" molecule label may 
be differentiated from that of the "donor". Since the efficiency of energy transfer between the labels is 
related to the distance separating the molecules, the spatial relationship between the molecules can be 
assessed. In a situation in which binding occurs between the molecules, the fluorescent emission of the 
"acceptor" molecule label in the assay should be maximal. An FET binding event can be conveniently 
measured through standard fluorometric detection means well known in the art (e.g., using a fluorimeter). 

[0145] In another embodiment, determining the presence or absence of an interaction between a test 
molecule and a target molecule can be effected by monitoring surface plasmon resonance (see, e.g. , 
Sjolander & Urbaniczk, Anal. Chem. 63: 2338-2345 (1991) and Szabo et al., Curr. Opin. Struct. Biol. 5: 
699-705 (1995)). "Surface plasmon resonance" or "biomolecular interaction analysis (BIA)" can be 
utilized to detect biospecific interactions in real time, without labeling any of the interactants (e.g., 
BIAcore). Changes in the mass at the binding surface (indicative of a binding event) result in alterations 
of the refractive index of light near the surface (the optical phenomenon of surface plasmon resonance 
(SPR)), resulting in a detectable signal which can be used as an indication of real-time reactions between 
biological molecules. 

[0146] In another embodiment, the target molecule or test molecules are anchored to a solid phase, 
facilitating the detection of target molecule/test molecule complexes and separation of the complexes 
from free, uncomplexed molecules. The target molecule or test molecule is immobilized to the solid 
support. In an embodiment, the target molecule is anchored to a solid surface, and the test molecule, 
which is not anchored, can be labeled, either directly or indirectly, with detectable labels discussed 
herein. 



41 



PATENT 
Docket 524593009100 



[0147] It may be desirable to immobilize a target molecule, an anti-target molecule antibody, and/or 
test molecules to facilitate separation of target molecule/test molecule complexes from uncomplexed 
forms, as well as to accommodate automation of the assay. The attachment between a test molecule 
and/or target molecule and the solid support may be covalent or non-covalent (see, e.g., U.S. Patent No. 
6,022,688 for non-covalent attachments). The solid support may be one or more surfaces of the system, 
such as one or more surfaces in each well of a microtiter plate, a surface of a silicon wafer, a surface of a 
bead (see, e.g., Lam, Nature 354: 82-84 (1991)) that is optionally linked to another solid support, or a 
channel in a microfluidic device, for example. Types of solid supports, linker molecules for covalent and 
non-covalent attachments to solid supports, and methods for immobilizing nucleic acids and other 
molecules to solid supports are well known (see, e.g., U.S. Patent Nos. 6,261,776; 5,900,481; 6,133,436; 
and 6,022,688; and WIPO publication WO 01/18234). 

[0148] In an embodiment, target molecule may be immobilized to surfaces via biotin and 
streptavidin. For example, biotinylated target polypeptide can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), 
and immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In another 
embodiment, a target polypeptide can be prepared as a fusion polypeptide. For example, glutathione-S- 
transferase/target polypeptide fusion can be adsorbed onto glutathione sepharose beads (Sigma Chemical, 
St. Louis, MO) or glutathione derivitized microtiter plates, which are then combined with a test molecule 
under conditions conducive to complex formation (e.g., at physiological conditions for salt and pH). 
Following incubation, the beads or microtiter plate wells are washed to remove any unbound 
components, or the matrix is immobilized in the case of beads, and complex formation is determined 
directly or indirectly as described above. Alternatively, the complexes can be dissociated from the 
matrix, and the level of target molecule binding or activity is determined using standard techniques. 

[0149] In an embodiment, the non-immobilized component is added to the coated surface containing 
the anchored component. After the reaction is complete, unreacted components are removed (e.g., by 
washing) under conditions such that a significant percentage of complexes formed will remain 
immobilized to the solid surface. The detection of complexes anchored on the solid surface can be 
accomplished in a number of manners. Where the previously non-immobilized component is pre-labeled, 
the detection of label immobilized on the surface indicates that complexes were formed. Where the 
previously non-immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface, e.g. , by adding a labeled antibody specific for the immobilized 
component, where the antibody, in turn, can be directly labeled or indirectly labeled with, e.g. , a labeled 
anti-Ig antibody. 

[0150] In another embodiment, an assay is performed utilizing antibodies that specifically bind 
target molecule or test molecule but do not interfere with binding of the target molecule to the test 
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molecule. Such antibodies can be derivitized to a solid support, and unbound target molecule may be 
immobilized by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of complexes using 
antibodies reactive with the target molecule, as well as enzyme-linked assays which rely on detecting an 
enzymatic activity associated with the target molecule. 

[0151] Cell free assays also can be conducted in a liquid phase. In such an assay, reaction products 
are separated from unreacted components, by any of a number of standard techniques, including but not 
limited to: differential centrifugation (see, e.g., Rivas, G., and Minton, Trends Biochem Set Aug; 1 8(8): 
284-7 (1993)); chromatography (gel filtration chromatography, ion-exchange chromatography); 
electrophoresis (see, e.g., Ausubel et al., eds. Current Protocols in Molecular Biology , J. Wiley: New 
York (1999)); and immunoprecipitation (see, e.g., Ausubel et al., eds., supra). Media and 
chromatographic techniques are known to one skilled in the art (see, e.g., Heegaard, J Mol. Recognit. 
Winter; 11(1-6): 141-8 (1998); Hage & Tweed, J. Chromatogr. B Biomed. Sci. Appl. Oct 10; 699 (1-2): 
499-525 (1997)). Further, fluorescence energy transfer may also be conveniently utilized, as described 
herein, to detect binding without further purification of the complex from solution. 

[0152] In another embodiment, modulators of target molecule expression are identified. For 
example, a cell or cell free mixture is contacted with a candidate compound and the expression of target 
mRNA or target polypeptide is evaluated relative to the level of expression of target mRNA or target 
polypeptide in the absence of the candidate compound. When expression of target mRNA or target 
polypeptide is greater in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an agonist of target mRNA or target polypeptide expression. Alternatively, 
when expression of target mRNA or target polypeptide is less (e.g., less with statistical significance) in 
the presence of the candidate compound than in its absence, the candidate compound is identified as an 
antagonist or inhibitor of target mRNA or target polypeptide expression. The level of target mRNA or 
target polypeptide expression can be determined by methods described herein. 

[0153] In another embodiment, binding partners that interact with a target molecule are detected. 
The target molecules can interact with one or more cellular or extracellular macromolecules, such as 
polypeptides in vivo, and these interacting molecules are referred to herein as "binding partners." 
Binding partners can agonize or antagonize target molecule biological activity. Also, test molecules that 
agonize or antagonize interactions between target molecules and binding partners can be useful as 
therapeutic molecules as they can up-regulate or down-regulated target molecule activity in vivo and 
thereby treat osteoarthritis. 

[0154] Binding partners of target molecules can be identified by methods known in the art. For 
example, binding partners may be identified by lysing cells and analyzing cell lysates by electrophoretic 
techniques. Alternatively, a two-hybrid assay or three-hybrid assay can be utilized (see, e.g., U.S. Patent 
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No. 5,283,317; Zervos et al, Cell 72:223-232 (1993); Madura et al., J. Biol. Chem. 268: 12046-12054 
(1993); Bartel et al., Biotechniques 14: 920-924 (1993); Iwabuchi et al., Oncogene 8: 1693-1696 (1993); 
and Brent WO94/10300). A two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. The assay often utilizes two 
different DNA constructs. In one construct, a TM7SF3 nucleic acid (sometimes referred to as the "bait") 
is fused to a gene encoding the DNA binding domain of a known transcription factor {e.g., GAL-4). In 
another construct, a DNA sequence from a library of DNA sequences that encodes a potential binding 
partner (sometimes referred to as the "prey") is fused to a gene that encodes an activation domain of the 
known transcription factor. Sometimes, a TM7SF3 nucleic acid can be fused to the activation domain. If 
the "bait" and the "prey" molecules interact in vivo, the DNA-binding and activation domains of the 
transcription factor are brought into close proximity. This proximity allows transcription of a reporter 
gene {e.g. , LacZ) which is operably linked to a transcriptional regulatory site responsive to the 
transcription factor. Expression of the reporter gene can be detected and cell colonies containing the 
functional transcription factor can be isolated and used to identify the potential binding partner. 

[0155] In an embodiment for identifying test molecules that antagonize or agonize complex 
formation between target molecules and binding partners, a reaction mixture containing the target 
molecule and the binding partner is prepared, under conditions and for a time sufficient to allow complex 
formation. The reaction mixture often is provided in the presence or absence of the test molecule. The 
test molecule can be included initially in the reaction mixture, or can be added at a time subsequent to the 
addition of the target molecule and its binding partner. Control reaction mixtures are incubated without 
the test molecule or with a placebo. Formation of any complexes between the target molecule and the 
binding partner then is detected. Decreased formation of a complex in the reaction mixture containing 
test molecule as compared to in a control reaction mixture indicates that the molecule antagonizes target 
molecule/binding partner complex formation. Alternatively, increased formation of a complex in the 
reaction mixture containing test molecule as compared to in a control reaction mixture indicates that the 
molecule agonizes target molecule/binding partner complex formation. In another embodiment, complex 
formation of target molecule/binding partner can be compared to complex formation of mutant target 
molecule/binding partner {e.g., amino acid modifications in a target polypeptide). Such a comparison can 
be important in those cases where it is desirable to identify test molecules that modulate interactions of 
mutant but not non-mutated target gene products. 

[0156] The assays can be conducted in a heterogeneous or homogeneous format. In heterogeneous 
assays, target molecule and/or the binding partner are immobilized to a solid phase, and complexes are 
detected on the solid phase at the end of the reaction. In homogeneous assays, the entire reaction is 
carried out in a liquid phase. In either approach, the order of addition of reactants can be varied to obtain 

different information about the molecules being tested. For example, test compounds that agonize target 
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molecule/binding partner interactions can be identified by conducting the reaction in the presence of the 
test molecule in a competition format. Alternatively, test molecules that agonize preformed complexes, 
e.g., molecules with higher binding constants that displace one of the components from the complex, can 
be tested by adding the test compound to the reaction mixture after complexes have been formed. 

[0157] In a heterogeneous assay embodiment, the target molecule or the binding partner is anchored 
onto a solid surface (e.g., a microtiter plate), while the non-anchored species is labeled, either directly or 
indirectly. The anchored molecule can be immobilized by non-covalent or covalent attachments. 
Alternatively, an immobilized antibody specific for the molecule to be anchored can be used to anchor 
the molecule to the solid surface. The partner of the immobilized species is exposed to the coated surface 
with or without the test molecule. After the reaction is complete, unreacted components are removed 
(e.g., by washing) such that a significant portion of any complexes formed will remain immobilized on 
the solid surface. Where the non-immobilized species is pre-labeled, the detection of label immobilized 
on the surface is indicative of complex. Where the non-immobilized species is not pre-labeled, an 
indirect label can be used to detect complexes anchored to the surface; e.g., by using a labeled antibody 
specific for the initially non-immobilized species. Depending upon the order of addition of reaction 
components, test compounds that inhibit complex formation or that disrupt preformed complexes can be 
detected. 

[0158] In another embodiment, the reaction can be conducted in a liquid phase in the presence or 
absence of test molecule, where the reaction products are separated from unreacted components, and the 
complexes are detected (e.g. , using an immobilized antibody specific for one of the binding components 
to anchor any complexes formed in solution, and a labeled antibody specific for the other partner to 
detect anchored complexes). Again, depending upon the order of addition of reactants to the liquid 
phase, test compounds that inhibit complex or that disrupt preformed complexes can be identified. 

[0159] In an alternate embodiment, a homogeneous assay can be utilized. For example, a preformed 
complex of the target gene product and the interactive cellular or extracellular binding partner product is 
prepared. One or both of the target molecule or binding partner is labeled, and the signal generated by 
the label(s) is quenched upon complex formation (e.g., U.S. Patent No. 4,109,496 that utilizes this 
approach for immunoassays). Addition of a test molecule that competes with and displaces one of the 
species from the preformed complex will result in the generation of a signal above background. In this 
way, test substances that disrupt target molecule/binding partner complexes can be identified. 

[0160] Candidate therapeutics for treating osteoarthritis are identified from a group of test molecules 
that interact with a target molecule. Test molecules are normally ranked according to the degree with 
which they modulate (e.g. , agonize or antagonize) a function associated with the target molecule (e.g. , 
DNA replication and/or processing, RNA transcription and/or processing, polypeptide production and/or 
processing, and/or biological function/activity), and then top ranking modulators are selected. Also, 
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pharmacogenomic information described herein can determine the rank of a modulator. The top 10% of 
ranked test molecules often are selected for further testing as candidate therapeutics, and sometimes the 
top 15%, 20%, or 25% of ranked test molecules are selected for further testing as candidate therapeutics. 
Candidate therapeutics typically are formulated for administration to a subject. 

Therapeutic Formulations 

[0161] Formulations and pharmaceutical compositions typically include in combination with a 
pharmaceutically acceptable carrier one or more target molecule modulators. The modulator often is a 
test molecule identified as having an interaction with a target molecule by a screening method described 
above. The modulator may be a compound, an antisense nucleic acid, a ribozyme, an antibody, or a 
binding partner. Also, formulations may comprise a target polypeptide or fragment thereof in 
combination with a pharmaceutically acceptable carrier. 

[0162] As used herein, the term "pharmaceutically acceptable carrier" includes solvents, dispersion 
media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. Supplementary active compounds can also be 
incorporated into the compositions. Pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

[0163] A pharmaceutical composition typically is formulated to be compatible with its intended 
route of administration. Examples of routes of administration include parenteral, e.g., intravenous, 
intradermal, subcutaneous, oral {e.g., inhalation), transdermal (topical), transmucosal, and rectal 
administration. Solutions or suspensions used for parenteral, intradermal, or subcutaneous application 
can include the following components: a sterile diluent such as water for injection, saline solution, fixed 
oils, polyethylene glycols, glycerin, propylene glycol or other synthetic solvents; antibacterial agents 
such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; 
chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. pH can be adjusted with 
acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic. 

[0164] Oral compositions generally include an inert diluent or an edible carrier. For the purpose of 
oral therapeutic administration, the active compound can be incorporated with excipients and used in the 
form of tablets, troches, or capsules, e.g. , gelatin capsules. Oral compositions can also be prepared using 
a fluid carrier for use as a mouthwash. Pharmaceutically compatible binding agents, and/or adjuvant 
materials can be included as part of the composition. The tablets, pills, capsules, troches and the like can 
contain any of the following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a 
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disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant such as magnesium stearate 
or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; 
or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 

[0165] Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological 
saline, bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline 
(PBS). In all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It should be stable under the conditions of manufacture and storage and must be 
preserved against the contaminating action of microorganisms such as bacteria and fungi. The carrier can 
be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of 
the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action 
of microorganisms can be achieved by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to 
include isotonic agents, for example, sugars, polyalcohols such as mannitol, sorbitol, sodium chloride in 
the composition. Prolonged absorption of the injectable compositions can be brought about by including 
in the composition an agent which delays absorption, for example, aluminum monostearate and gelatin. 

[0166] Sterile injectable solutions can be prepared by incorporating the active compound in the 
required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the 
active compound into a sterile vehicle which contains a basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuum drying and freeze-drying which 
yields a powder of the active ingredient plus any additional desired ingredient from a previously sterile- 
filtered solution thereof. 

[0167] For administration by inhalation, the compounds are delivered in the form of an aerosol 
spray from pressured container or dispenser which contains a suitable propellant, e.g. , a gas such as 
carbon dioxide, or a nebulizer. 

[0168] Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are 
used in the formulation. Such penetrants are generally known in the art, and include, for example, for 
transmucosal administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal 
administration can be accomplished through the use of nasal sprays or suppositories. For transdermal 
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administration, the active compounds are formulated into ointments, salves, gels, or creams as generally 
known in the art. Molecules can also be prepared in the form of suppositories (e.g. , with conventional 
suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal delivery. 

[0169] In one embodiment, active molecules are prepared with carriers that will protect the 
compound against rapid elimination from the body, such as a controlled release formulation, including 
implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, 
such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and 
polylactic acid. Methods for preparation of such formulations will be apparent to those skilled in the art. 
Materials can also be obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. 
Liposomal suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutically acceptable carriers. These can be prepared 
according to methods known to those skilled in the art, for example, as described in U.S. Patent 
No. 4,522,811. 

[0170] It is advantageous to formulate oral or parenteral compositions in dosage unit form for ease 
of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete 
units suited as unitary dosages for the subject to be treated; each unit containing a predetermined quantity 
of active compound calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. 

[0171] Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD 50 (the 
dose lethal to 50% of the population) and the ED 50 (the dose therapeutically effective in 50% of the 
population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio LD 5 o/ED 5 o. Molecules which exhibit high therapeutic indices are preferred. While 
molecules that exhibit toxic side effects may be used, care should be taken to design a delivery system 
that targets such compounds to the site of affected tissue in order to minimize potential damage to 
uninfected cells and, thereby, reduce side effects. 

[0172] The data obtained from the cell culture assays and animal studies can be used in formulating 
a range of dosage for use in humans. The dosage of such molecules lies preferably within a range of 
circulating concentrations that include the ED 50 with little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and the route of administration utilized. For any 
molecules used in the methods described herein, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC 50 (i.e., the concentration of the test compound which 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can be 
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used to more accurately determine useful doses in humans. Levels in plasma may be measured, for 
example, by high performance liquid chromatography. 

[0173] As defined herein, a therapeutically effective amount of protein or polypeptide (i.e., an 
effective dosage) ranges from about 0.001 to 30 mg/kg body weight, sometimes about 0.01 to 25 mg/kg 
body weight, often about 0.1 to 20 mg/kg body weight, and more often about 1 to 10 mg/kg, 2 to 9 
mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, or 5 to 6 mg/kg body weight. The protein or polypeptide can be 
administered one time per week for between about 1 to 1 0 weeks, sometimes between 2 to 8 weeks, often 
between about 3 to 7 weeks, and more often for about 4, 5, or 6 weeks. The skilled artisan will 
appreciate that certain factors may influence the dosage and timing required to effectively treat a subject, 
including but not limited to the severity of the disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. Moreover, treatment of a subject with a 
therapeutically effective amount of a protein, polypeptide, or antibody can include a single treatment or, 
preferably, can include a series of treatments. 

[0174] With regard to polypeptide formulations, featured herein is a method for treating 
osteoarthritis in a subject, which comprises contacting one or more cells in the subject with a first 
polypeptide, where the subject comprises a second polypeptide having one or more polymorphic 
variations associated with cancer, and where the first polypeptide comprises fewer polymorphic 
variations associated with cancer than the second polypeptide. The first and second polypeptides are 
encoded by a nucleic acid which comprises a nucleotide sequence in SEQ ID NO: 1-2; a nucleotide 
sequence which encodes a polypeptide consisting of an amino acid sequence encoded by a nucleotide 
sequence referenced in SEQ ID NO: 1-2; a nucleotide sequence which encodes a polypeptide that is 90% 
or more identical to an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1 -2 and a 
nucleotide sequence 90% or more identical to a nucleotide sequence in SEQ ID NO: 1 -2. The subject 
often is a human. 

[0175] For antibodies, a dosage of 0.1 mg/kg of body weight (generally 10 mg/kg to 20 mg/kg) is 
often utilized. If the antibody is to act in the brain, a dosage of 50 mg/kg to 1 00 mg/kg is often 
appropriate. Generally, partially human antibodies and fully human antibodies have a longer half-life 
within the human body than other antibodies. Accordingly, lower dosages and less frequent 
administration is often possible. Modifications such as lipidation can be used to stabilize antibodies and 
to enhance uptake and tissue penetration (e.g., into the brain). A method for lipidation of antibodies is 
described by Cruikshank et ai, J. Acquired Immune Deficiency Syndromes and Human Retrovirology 
1 4.193 (1997). 

[0176] Antibody conjugates can be used for modifying a given biological response, the drug moiety 
is not to be construed as limited to classical chemical therapeutic agents. For example, the drug moiety 
may be a protein or polypeptide possessing a desired biological activity. Such proteins may include, for 
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example, a toxin such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a polypeptide such as 
tumor necrosis factor, alpha-interferon, beta-interferon, nerve growth factor, platelet derived growth 
factor, tissue plasminogen activator; or, biological response modifiers such as, for example, lymphokines, 
interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), interleukin-6 ("IL-6"), granulocyte macrophage colony 
stimulating factor ("GM-CSF"), granulocyte colony stimulating factor ("G-CSF"), or other growth 
factors. Alternatively, an antibody can be conjugated to a second antibody to form an antibody 
heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

[0177] For compounds, exemplary doses include milligram or microgram amounts of the compound 
per kilogram of subject or sample weight, for example, about 1 microgram per kilogram to about 500 
milligrams per kilogram, about 100 micrograms per kilogram to about 5 milligrams per kilogram, or 
about 1 microgram per kilogram to about 50 micrograms per kilogram. It is understood that appropriate 
doses of a small molecule depend upon the potency of the small molecule with respect to the expression 
or activity to be modulated. When one or more of these small molecules is to be administered to an 
animal (e.g. , a human) in order to modulate expression or activity of a polypeptide or nucleic acid 
described herein, a physician, veterinarian, or researcher may, for example, prescribe a relatively low 
dose at first, subsequently increasing the dose until an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend upon a variety of 
factors including the activity of the specific compound employed, the age, body weight, general health, 
gender, and diet of the subject, the time of administration, the route of administration, the rate of 
excretion, any drug combination, and the degree of expression or activity to be modulated. 

[0178] With regard to nucleic acid formulations, gene therapy vectors can be delivered to a subject 
by, for example, intravenous injection, local administration (see, e.g., U.S. Patent 5,328,470) or by 
stereotactic injection (see e.g., Chen et al, (1994) Proc. Natl. Acad. Sci. USA P/:3054-3057). 
Pharmaceutical preparations of gene therapy vectors can include a gene therapy vector in an acceptable 
diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector can be produced intact from recombinant cells 
(e.g. , retroviral vectors) the pharmaceutical preparation can include one or more cells which produce the 
gene delivery system. Examples of gene delivery vectors are described herein. 

Therapeutic Methods 

[0179] A therapeutic formulation described above can be administered to a subject in need of a 
therapeutic for inducing a desired biological response.. Therapeutic formulations can be administered by 
any of the paths described herein. With regard to both prophylactic and therapeutic methods of 
treatment, such treatments may be specifically tailored or modified, based on knowledge obtained from 
pharmacogenomic analyses described herein. 
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[0180] As used herein, the term "treatment" is defined as the application or administration of a 
therapeutic formulation to a subject, or application or administration of a therapeutic agent to an isolated 
tissue or cell line from a subject with the purpose to cure, heal, alleviate, relieve, alter, remedy, 
ameliorate, improve or affect osteoarthritis, symptoms of osteoarthritis or a predisposition towards 
osteoarthritis. A therapeutic formulation includes, but is not limited to, small molecules, peptides, 
antibodies, ribozymes and antisense oligonucleotides. Administration of a therapeutic formulation can 
occur prior to the manifestation of symptoms characteristic of osteoarthritis, such that osteoarthritis is 
prevented or delayed in its progression. The appropriate therapeutic composition can be determined 
based on screening assays described herein. 

[0181] As discussed, successful treatment of osteoarthritis can be brought about by techniques that 
serve to agonize target molecule expression or function, or alternatively, antagonize target molecule 
expression or function. These techniques include administration of modulators that include, but are not 
limited to, small organic or inorganic molecules; antibodies (including, for example, polyclonal, 
monoclonal, humanized, anti-idiotypic, chimeric or single chain antibodies, and Fab, F(ab')2 and Fab 
expression library fragments, scFV molecules, and epitope-binding fragments thereof); and peptides, 
phosphopeptides, or polypeptides. 

[0182] Further, antisense and ribozyme molecules that inhibit expression of the target gene can also 
be used to reduce the level of target gene expression, thus effectively reducing the level of target gene 
activity. Still further, triple helix molecules can be utilized in reducing the level of target gene activity. 
Antisense, ribozyme and triple helix molecules are discussed above. It is possible that the use of 
antisense, ribozyme, and/or triple helix molecules to reduce or inhibit mutant gene expression can also 
reduce or inhibit the transcription (triple helix) and/or translation (antisense, ribozyme) of mRNA 
produced by normal target gene alleles, such that the concentration of normal target gene product present 
can be lower than is necessary for a normal phenotype. In such cases, nucleic acid molecules that encode 
and express target gene polypeptides exhibiting normal target gene activity can be introduced into cells 
via gene therapy method. Alternatively, in instances in that the target gene encodes an extracellular 
polypeptide, it can be preferable to co-administer normal target gene polypeptide into the cell or tissue in 
order to maintain the requisite level of cellular or tissue target gene activity. 

[0183] Another method by which nucleic acid molecules may be utilized in treating or preventing 
osteoarthritis is use of aptamer molecules specific for target molecules. Aptamers are nucleic acid 
molecules having a tertiary structure which permits them to specifically bind to ligands (see, e.g., 
Osborne, et al, Curr. Opin. Chem. Biol. 1(1): 5-9 (1997); and Patel, D. J., Curr. Opin. Chem. Biol. 
Jun;l(l): 32-46(1997)). 

[0184] Yet another method of utilizing nucleic acid molecules for osteoarthritis treatment is gene 
therapy, which can also be referred to as allele therapy. Provided herein is a gene therapy method for 

51 



PATENT 
Docket 524593009100 



treating osteoarthritis in a subject, which comprises contacting one or more cells in the subject or from 
the subject with a nucleic acid having a first nucleotide sequence (e.g., the first nucleotide sequence is 
identical to or substantially identical to a nucleotide sequence of SEQ ID NO: 1-2). Genomic DNA in 
the subject comprises a second nucleotide sequence having one or more polymorphic variations 
associated with osteoarthritis (e.g., the second nucleotide sequence is identical to or substantially 
identical to a nucleotide sequence of SEQ ID NO: 1 ). The first and second nucleotide sequences 
typically are substantially identical to one another, and the first nucleotide sequence comprises fewer 
polymorphic variations associated with osteoarthritis than the second nucleotide sequence. The first 
nucleotide sequence may comprise a gene sequence that encodes a full-length polypeptide or a fragment 
thereof. The subject is often a human. Allele therapy methods often are utilized in conjunction with a 
method of first determining whether a subject has genomic DNA that includes polymorphic variants 
associated with osteoarthritis. 

[0185] In another allele therapy embodiment, provided herein is a method which comprises 
contacting one or more cells in the subject or from the subject with a polypeptide encoded by a nucleic 
acid having a first nucleotide sequence (e.g., the first nucleotide sequence is identical to or substantially 
identical to the nucleotide sequence of SEQ ID NO: 1-2). Genomic DNA in the subject comprises a 
second nucleotide sequence having one or more polymorphic variations associated with osteoarthritis 
(e.g., the second nucleotide sequence is identical to or substantially identical to a nucleotide sequence of 
SEQ ID NO: 1). The first and second nucleotide sequences typically are substantially identical to one 
another, and the first nucleotide sequence comprises fewer polymorphic variations associated with 
osteoarthritis than the second nucleotide sequence. The first nucleotide sequence may comprise a gene 
sequence that encodes a full-length polypeptide or a fragment thereof. The subject is often a human. 

[0186] For antibody-based therapies, antibodies can be generated that are both specific for target 
molecules and that reduce target molecule activity. Such antibodies may be administered in instances 
where antagonizing a target molecule function is appropriate for the treatment of osteoarthritis. 

[0187] In circumstances where stimulating antibody production in an animal or a human subject by 
injection with a target molecule is harmful to the subject, it is possible to generate an immune response 
against the target molecule by use of anti-idiotypic antibodies {see, e.g., Herlyn, Ann. Med.; 3 1(1): 66-78 
(1999); and Bhattacharya-Chatterjee & Foon, Cancer Treat. Res.; 94: 51-68 (1998)). Introducing an 
anti-idiotypic antibody to a mammal or human subject often stimulates production of anti-anti-idiotypic 
antibodies, which typically are specific to the target molecule. Vaccines directed to osteoarthritis also 
may be generated in this fashion. 

[0188] In instances where the target molecule is intracellular and whole antibodies are used, 
internalizing antibodies may be preferred. Lipofectin or liposomes can be used to deliver the antibody or 
a fragment of the Fab region that binds to the target antigen into cells. Where fragments of the antibody 
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are used, the smallest inhibitory fragment that binds to the target antigen is preferred. For example, 
peptides having an amino acid sequence corresponding to the Fv region of the antibody can be used. 
Alternatively, single chain neutralizing antibodies that bind to intracellular target antigens can also be 
administered. Such single chain antibodies can be administered, for example, by expressing nucleotide 
sequences encoding single-chain antibodies within the target cell population (see, e.g., Marasco et al., 
Proc. Natl. Acad. Sci. USA 90: 7889-7893 (1993)). 

[0189] Modulators can be administered to a patient at therapeutically effective doses to treat 
osteoarthritis. A therapeutically effective dose refers to an amount of the modulator sufficient to result in 
amelioration of symptoms of osteoarthritis. Toxicity and therapeutic efficacy of modulators can be 
detennined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., for 
determining the LD 50 (the dose lethal to 50% of the population) and the ED 50 (the dose therapeutically 
effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio LD50/ED50. Modulators that exhibit large 
therapeutic indices are preferred. While modulators that exhibit toxic side effects can be used, care 
should be taken to design a delivery system that targets such molecules to the site of affected tissue in 
order to minimize potential damage to uninfected cells, thereby reducing side effects. 

[0190] Data obtained from cell culture assays and animal studies can be used in formulating a range 
of dosages for use in humans. The dosage of such compounds lies preferably within a range of 
circulating concentrations that include the ED 50 with little or no toxicity. The dosage can vary within this 
range depending upon the dosage form employed and the route of administration utilized. For any 
compound used in the methods described herein, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose can be formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC 50 (i.e., the concentration of the test compound that 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can be 
used to more accurately determine useful doses in humans. Levels in plasma can be measured, for 
example, by high performance liquid chromatography. 

[0191] Another example of effective dose determination for an individual is the ability to directly 
assay levels of "free" and "bound" compound in the serum of the test subject. Such assays may utilize 
antibody mimics and/or "biosensors" that have been created through molecular imprinting techniques. 
Molecules that modulate target molecule activity are used as a template, or "imprinting molecule", to 
spatially organize polymerizable monomers prior to their polymerization with catalytic reagents. The 
subsequent removal of the imprinted molecule leaves a polymer matrix which contains a repeated 
"negative image" of the compound and is able to selectively rebind the molecule under biological assay 
conditions. A detailed review of this technique can be seen in Ansell et al., Current Opinion in 
Biotechnology 7: 89-94 (1996) and in Shea, Trends in Polymer Science 2: 166-173 (1994). Such 

53 



PATENT 
Docket 524593009100 



"imprinted" affinity matrixes are amenable to ligand-binding assays, whereby the immobilized 
monoclonal antibody component is replaced by an appropriately imprinted matrix. An example of the 
use of such matrixes in this way can be seen in Vlatakis, et al.. Nature 361: 645-647 (1 993). Through the 
use of isotope-labeling, the "free" concentration of compound which modulates target molecule 
expression or activity readily can be monitored and used in calculations of IC 50 - Such "imprinted" 
affinity matrixes can also be designed to include fluorescent groups whose photon-emitting properties 
measurably change upon local and selective binding of target compound. These changes readily can be 
assayed in real time using appropriate fiberoptic devices, in turn allowing the dose in a test subject to be 
quickly optimized based on its individual IC 50 . An example of such a "biosensor" is discussed in Kriz et 
al., Analytical Chemistry 67: 2142-2144 (1995). 

[0192] The examples set forth below are intended to illustrate but not limit the invention. 

Examples 

[0193] In the following studies a group of subjects was selected according to specific parameters 
relating to osteoarthritis (OA). Nucleic acid samples obtained from individuals in the study group were 
subjected to genetic analysis, which identified associations between osteoarthritis and a polymorphism in 
the TM7SF3 gene on chromosome twelve. The polymorphism was genotyped again in two replication 
cohorts consisting of individuals selected for OA. In addition, SNPs proximal to the incident 
polymorphism were identified and allelotyped in OA case and control pools. Methods are described for 
producing TM7SF3 polypeptide and TM7SF3 polypeptide variants in vitro or in vivo, TM7SF3 nucleic 
acids or polypeptides and variants thereof are utilized for screening test molecules for those that interact 
with TM7SF3 molecules. Test molecules identified as interactors with TM7SF3 molecules and TM7SF3 
variants are further screened in vivo to determine whether they treat osteoarthritis. 

Example 1 
Samples and Pooling Strategies 

Sample Selection 

[0194] Blood samples were collected from individuals diagnosed with knee osteoarthritis, which 
were referred to as case samples. Also, blood samples were collected from individuals not diagnosed 
with knee osteoarthritis as gender and age-matched controls. A database was created that listed all 
phenotypic trait information gathered from individuals for each case and control sample. Genomic DNA 
was extracted from each of the blood samples for genetic analyses. 
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DNA Extraction from Blood Samples 

[0195] Six to ten milliliters of whole blood was transferred to a 50 ml tube containing 27 ml of red 
cell lysis solution (RCL). The tube was inverted until the contents were mixed. Each tube was incubated 
for 10 minutes at room temperature and inverted once during the incubation. The tubes were then 
centrifuged for 20 minutes at 3000 x g and the supernatant was carefully poured off. 100-200 ul of 
residual liquid was left in the tube and was pipetted repeatedly to resuspend the pellet in the residual 
supernatant. White cell lysis solution (WCL) was added to the tube and pipetted repeatedly until 
completely mixed. While no incubation was normally required, the solution was incubated at 37°C or 
room temperature if cell clumps were visible after mixing until the solution was homogeneous. 2 ml of 
protein precipitation was added to the cell lysate. The mixtures were vortexed vigorously at high speed 
for 20 sec to mix the protein precipitation solution uniformly with the cell lysate, and then centrifuged for 
10 minutes at 3000 x g. The supernatant containing the DNA was then poured into a clean 15 ml tube, 
which contained 7 ml of 100% isopropanol. The samples were mixed by inverting the tubes gently until 
white threads of DNA were visible. Samples were centrifuged for 3 minutes at 2000 x g and the DNA 
was visible as a small white pellet. The supernatant was decanted and 5 ml of 70% ethanol was added to 
each tube. Each tube was inverted several times to wash the DNA pellet, and then centrifuged for 1 
minute at 2000 x g. The ethanol was decanted and each tube was drained on clean absorbent paper. The 
DNA was dried in the tube by inversion for 1 0 minutes, and then 1 000 ul of 1 X TE was added. The size 
of each sample was estimated, and less TE buffer was added during the following DNA hydration step if 
the sample was smaller. The DNA was allowed to rehydrate overnight at room temperature, and DNA 
samples were stored at 2-8°C. 

[0196] DNA was quantified by placing samples on a hematology mixer for at least 1 hour. DNA 
was serially diluted (typically 1 :80, 1 : 1 60, 1 :320, and 1 :640 dilutions) so that it would be within the 
measurable range of standards. 125 ul of diluted DNA was transferred to a clear U-bottom microtitre 
plate, and 125 ul of IX TE buffer was transferred into each well using a multichannel pipette. The DNA 
and IX TE were mixed by repeated pipetting at least 1 5 times, and then the plates were sealed. 50 ul of 
diluted DNA was added to wells A5-H12 of a black flat bottom microtitre plate. Standards were inverted 
six times to mix them, and then 50 ul of IX TE buffer was pipetted into well Al, 1000 ng/ml of standard 
was pipetted into well A2, 500 ng/ml of standard was pipetted into well A3, and 250 ng/ml of standard 
was pipetted into well A4. PicoGreen (Molecular Probes, Eugene, Oregon) was thawed and freshly 
diluted 1 :200 according to the number of plates that were being measured. PicoGreen was vortexed and 
then 50ul was pipetted into all wells of the black plate with the diluted DNA. DNA and PicoGreen were 
mixed by pipetting repeatedly at least 10 times with the multichannel pipette. The plate was placed into a 
Fluoroskan Ascent Machine (microplate fluorometer produced by Labsystems) and the samples were 
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allowed to incubate for 3 minutes before the machine was run using filter pairs 485 nm excitation and 
538 nm emission wavelengths. Samples having measured DNA concentrations of greater than 450 ng/jal 
were re-measured for conformation. Samples having measured DNA concentrations of 20 ng/ul or less 
were re-measured for confirmation. 

Pooling Strategies - Discovery Cohort 

[0197] Samples were derived from the Nottingham knee OA family study (UK) where index cases 
were identified through a knee replacement registry. Siblings were approached and assessed with knee x- 
rays and assigned status as affected or unaffected. In all 1 ,1 57 individuals were available. In order to 
create same-sex pools of appropriate sizes, 335 unrelated female individuals with OA from the 
Nottingham OA sample were selected for the case pool. The control pool was made up of unrelated 
female individuals from the St. Thomas twin study (England) with normal knee x-rays and without other 
indications of OA, regardless of anatomical location, as well as lacking family history of OA. The St. 
Thomas twin study consists of Caucasian, female participants from the St. Thomas' Hospital, London, 
adult-twin registry, which is a voluntary registry of >4,000 twin pairs ranging from 18 to 76 years of age. 
The female case samples and female control samples are described further in Table 1 below. 

[0198] A select set of samples from each group were utilized to generate pools, and one pool was 
created for each group. Each individual sample in a pool was represented by an equal amount of 
genomic DNA. For example, where 25 ng of genomic DNA was utilized in each PCR reaction and there 
were 200 individuals in each pool, each individual would provide 125 pg of genomic DNA. Inclusion or 
exclusion of samples for a pool was based upon the following criteria: the sample was derived from an 
individual characterized as Caucasian; the sample was derived from an individual of British paternal and 
maternal descent; case samples were derived from individuals diagnosed with specific knee osteoarthritis 
(OA) and were recruited from an OA knee replacement clinic. Control samples were derived from 
individuals free of OA, family history of OA, and rheumatoid arthritis. Also, sufficient genomic DNA 
was extracted from each blood sample for all allelotyping and genotyping reactions performed during the 
study. Phenotype information from each individual was collected and included age of the individual, 
gender, family history of OA, general medical information (e.g., height, weight, thyroid disease, 
diabetes, psoriasis, hysterectomy), joint history (previous and current symptoms, joint-related operations, 
age at onset of symptoms, date of primary diagnosis, age of individual as of primary diagnosis and order 
of involvement), and knee-related findings (crepitus, restricted passive movement, bony 
swelling/deformity). Additional knee information included knee history, current symptoms, any major 
knee injury, menisectomy, knee replacement surgery, age of surgery, and treatment history (including 
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hormone replace therapy (HRT)). Samples that met these criteria were added to appropriate pools based 
on disease status. 

[0199J The selection process yielded the pools set forth in Table 1 , which were used in the studies 
that follow: 



TABLE 1 





Female case 


Female control 


Pool size 
(Number) 


335 


335 


Pool Criteria 

(ex: case/control) 


control 


case 


Mean Age 

(ex: years) 


57.21 


69.95 



Example 2 

Association of Polymorphic Variants with Osteoarthritis 

[0200] A whole-genome screen was performed to identify particular SNPs associated with 
occurrence of osteoarthritis. As described in Example 1 , two sets of samples were utilized, which 
included samples from female individuals having knee osteoarthritis (osteoarthritis cases), and samples 
from female individuals not having knee osteoarthritis (female controls). The initial screen of each pool 
was performed in an allelotyping study, in which certain samples in each group were pooled. By pooling 
DNA from each group, an allele frequency for each SNP in each group was calculated. These allele 
frequencies were then compared to one another. Particular SNPs were considered as being associated 
with osteoarthritis when allele frequency differences calculated between case and control pools were 
statistically significant. SNP disease association results obtained from the allelotyping study were then 
validated by genotyping each associated SNP across all samples from each pool. The results of the 
genotyping then were analyzed, allele frequencies for each group were calculated from the individual 
genotyping results, and a p-value was calculated to determine whether the case and control groups had 
statistically significant differences in allele frequencies for a particular SNP. When the genotyping 
results agreed with the original allelotyping results, the SNP disease association was considered validated 
at the genetic level. 

SNP Panel Used for Genetic Analyses 

[0201] A whole-genome SNP screen began with an initial screen of approximately 25,000 SNPs 
over each set of disease and control samples using a pooling approach. The pools studied in the screen 
are described in Example 1 . The SNPs analyzed in this study were part of a set of 25,488 SNPs 
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confirmed as being statistically polymorphic as each is characterized as having a minor allele frequency 
of greater than 10%. The SNPs in the set reside in genes or in close proximity to genes, and many reside 
in gene exons. Specifically, SNPs in the set are located in exons, introns, and within 5,000 base-pairs 
upstream of a transcription start site of a gene. In addition, SNPs were selected according to the 
following criteria: they are located in ESTs; they are located in Locuslink or Ensembl genes; and they are 
located in Genomatix promoter predictions. SNPs in the set were also selected on the basis of even 
spacing across the genome, as depicted in Table 2. 

[0202] A case-control study design using a whole genome association strategy involving 
approximately 28,000 single nucleotide polymorphisms (SNPs) was employed. Approximately 25,000 
SNPs were evenly spaced in gene-based regions of the human genome with a median inter-marker 
distance of about 40,000 base pairs. Additionally, approximately 3,000 SNPs causing amino acid 
substitutions in genes described in the literature as candidates for various diseases were used. The case- 
control study samples were of female Caucasian origin (British paternal and maternal descent) 670 
individuals were equally distributed in two groups: female controls and female cases. The whole genome 
association approach was first conducted on 2 DNA pools representing the 2 groups. Significant markers 
were confirmed by individual genotyping. 



TABLE 2 



General Statistics 


Spacing Statistics 


Total # of SNPs 


25,488 


Median 


37,058 bp 


#ofExonic SNPs 


>4,335 (17%) 


Minimum* 


1,000 bp 


# SNPs with refSNP ID 


20,776 (81%) 


Maximum* 


3,000,000 bp 


Gene Coverage 


> 10,000 


Mean 


122,412 bp 


Chromosome Coverage 


All 


Std Deviation 


373,325 bp 






^Excludes outliers 





Allelotvping and Genotyping Results 

[0203] The genetic studies summarized above and described in more detail below identified an 
allelic variant in the TM7SF3 gene that is associated with osteoarthritis. 

Assay for Verifying. Allelotvping. and Genotyping SNPs 

[0204] A MassARRAY™ system (Sequenom, Inc.) was utilized to perform SNP genotyping in a 
high-throughput fashion. This genotyping platform was complemented by a homogeneous, single-tube 
assay method (hME™ or homogeneous MassEXTEND™ (Sequenom, Inc.)) in which two genotyping 
primers anneal to and amplify a genomic target surrounding a polymorphic site of interest. A third 
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primer (the MassEXTEND™ primer), which is complementary to the amplified target up to but not 
including the polymorphism, was then enzymatically extended one or a few bases through the 
polymorphic site and then terminated. 

[0205] For each polymorphism, SpectroDESIGNER™ software (Sequenom, Inc.) was used to 
generate a set of PCR primers and a MassEXTEND™ primer which where used to genotype the 
polymorphism. Other primer design software could be used or one of ordinary skill in the art could 
manually design primers based on his or her knowledge of the relevant factors and considerations in 
designing such primers. Table 3 shows PCR primers and Table 4 shows extension primers used for 
analyzing polymorphisms. The initial PCR amplification reaction was performed in a 5 ul total volume 
containing IX PCR buffer with 1 .5 mM MgCl 2 (Qiagen), 200 uM each of dATP, dGTP, dCTP, dTTP 
(Gibco-BRL), 2.5 ng of genomic DNA, 0.1 units of HotStar DNA polymerase (Qiagen), and 200 nM 
each of forward and reverse PCR primers specific for the polymorphic region of interest. 



TABLE 3: PCR Primers 



SNP 
Reference 


Forward PCR primer 


Reverse PCR primer 


rs1 484086 


ACGTTGGATGTGTCACTCTTCGGAAGTCTC 


ACGTTGGATGCATGTACAGGGCATTCACAG 



[0206] Samples were incubated at 95°C for 1 5 minutes, followed by 45 cycles of 95°C for 20 
seconds, 56°C for 30 seconds, and 72°C for 1 minute, finishing with a 3 minute final extension at 72°C. 
Following amplification, shrimp alkaline phosphatase (SAP) (0.3 units in a 2 ul volume) (Amersham 
Pharmacia) was added to each reaction (total reaction volume was 7 ul) to remove any residual dNTPs 
that were not consumed in the PCR step. Samples were incubated for 20 minutes at 37°C, followed by 5 
minutes at 85°C to denature the SAP. 

[0207] Once the SAP reaction was complete, a primer extension reaction was initiated by adding a 
polymorphism-specific MassEXTEND™ primer cocktail to each sample. Each MassEXTEND™ 
cocktail included a specific combination of dideoxynucleotides (ddNTPs) and deoxynucleotides (dNTPs) 
used to distinguish polymorphic alleles from one another. Methods for verifying, allelotyping and 
genotyping SNPs are disclosed, for example, in U.S. Pat. No. 6,258,538, the content of which is hereby 
incorporated by reference. In Table 4, ddNTPs are shown and the fourth nucleotide not shown is the 
dNTP. 



TABLE 4: Extension Primers 



SNP 
Reference 


Extend Probe 


Termination 
Mix 


rs1 484086 


CTCTTCGGAAGTCTCTTTCTCA 


ACT 
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[0208] The MassEXTEND™ reaction was performed in a total volume of 9 ul, with the addition of 
IX ThermoSequenase buffer, 0.576 units of ThermoSequenase (Amersham Pharmacia), 600 nM 
MassEXTEND™ primer, 2 mM of ddATP and/or ddCTP and/or ddGTP and/or ddTTP, and 2 mM of 
dATP or dCTP or dGTP or dTTP. The deoxy nucleotide (dNTP) used in the assay normally was 
complementary to the nucleotide at the polymorphic site in the amplicon. Samples were incubated at 
94°C for 2 minutes, followed by 55 cycles of 5 seconds at 94°C, 5 seconds at 52°C, and 5 seconds 
at 72°C. 

[0209] Following incubation, samples were desalted by adding 1 6 ul of water (total reaction volume 
was 25 pi), 3 mg of SpectroCLEAN™ sample cleaning beads (Sequenom, Inc.) and allowed to incubate 
for 3 minutes with rotation. Samples were then robotically dispensed using a piezoelectric dispensing 
device (SpectroJET™ (Sequenom, Inc.)) onto either 96-spot or 384-spot silicon chips containing a matrix 
that crystallized each sample (SpectroCHIP™ (Sequenom, Inc.)). Subsequently, MALDI-TOF mass 
spectrometry (Biflex and Autoflex MALDI-TOF mass spectrometers (Bruker Daltonics) can be used) and 
SpectroTYPER RT™ software (Sequenom, Inc.) were used to analyze and interpret the SNP genotype for 
each sample. 

Genetic Analysis 

[0210] Minor allelic frequencies for the polymorphisms set forth in Table A were verified as being 
10% or greater using the extension assay described above in a group of samples isolated from 92 
individuals originating from the state of Utah in the United States, Venezuela and France (Coriell cell 
repositories). 

[0211] Genotyping results are shown for female pools in Table 5. In Table 5, "AF" refers to allelic 
frequency; and "F case" and "F control" refer to female case and female control groups, respectively. 



TABLE 5: Genotyping Results 



SNP Reference 


AF 
F case 


AF 
F control 


p-value 


rs1484086 


T = 0.821 
C = 0.179 


T = 0.753 
C = 0.247 


0.0027 



[0212] The single marker alleles set forth in Table A were considered validated, since the 
genotyping data agreed with the allelotyping data and each SNP significantly associated with 
osteoarthritis. Particularly significant associations with osteoarthritis are indicated by a calculated p- 
value of less than 0.05 for genotype results. 
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Example 3 

Association of Polymorphic Variants with Osteoarthritis in Replication Cohorts 

[0213] The single marker polymorphism set forth in Table A was genotyped again in two replication 
cohorts consisting of individuals selected for OA. 

Sample Selection and Pooling Strategies - Replication Sample 1 

[0214] A second case control sample (replication sample #1) was created by using 100 Caucasian 
female cases from Chingford, UK, and 1 48 unrelated female cases from the St. Thomas twin study. 
Cases were defined as having Kellgren-Lawrence (KL) scores of at least 2 in at least one knee x-ray. In 
addition, 199 male knee replacement cases from Nottingham were included. (For a cohort description, 
see the Nottingham description provided in Example 1). The control pool was made up of unrelated 
female individuals from the St. Thomas twin study (England) with normal knee x-rays and without other 
indications of OA, regardless of anatomical location, as well as lacking family history of OA. The St. 
Thomas twin study consists of Caucasian, female participants from the St. Thomas' Hospital, London, 
adult-twin registry, which is a voluntary registry of >4,000 twin pairs ranging from 18 to 76 years of age. 
The replication sample 1 cohort was used to replicate the initial results. Table 6 below summarizes the 
selected phenotype data collected from the case and control individuals. 



TABLE 6 



Phenotype 


Female cases (n=248): 
median (range)/ (n,%) 


Male cases (n=199): 

median (range)/ (n,%) 


Female controls (n=313): 

mean (range)/ (n,%) 


Age 


59 (39- 73) 


66 (45- 73) 


55 (50- 72) 


Height (cm) 


162(141- 178) 


175(152- 198) 


162(141- 176) 


Weight (kg) 


68 (51- 123) 


86 (62- 127) 


64 (40-111) 


Body mass index 
(kg/m 2 ) 


26(18-44) 


29 (21-41) 


24(18- 46) 


Kellgren- 
Lawrence* left 
knee 


0 (63, 26%), 1 (20, 8%), 2 
(105, 43%), 3(58, 23%), 4 
(1,0%) 


NA 


NA 


Kellgren- 
Lawrence* right 
knee 


0(43,7%), 1 (18, 7%), 2 
(127, 52%), 3(57, 23%), 4 
(1,0%) 


NA 


NA 


KL* >2 both 
knees 


No (145, 59%), Yes (101, 
41%) 


NA 


NA 


KL* >2 either 
knee 


No (0, 0%), Yes (248, 100%) 


NA 


NA 
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* 0: normal, 1: doubtful, 2: definite osteophyte (bony protuberance), 3: joint space narrowing (with or without 
osteophyte), 4: joint deformity 

Sample Selection and Pooling Strategies - Replication Sample 2 

[0215] A third case control sample (replication sample #2) was created by using individuals with 
symptoms of OA. These individuals were recruited and examined by reumatologists. Affected joints 
were x-rayed and a diagnosis of definite or probable OA was made according to American College of 
Reumatology criteria by a single reumatologist. Controls were recruited from volunteers without any 
symptoms from the muskuloskeletal system based on a normal joint exam performed by a reumatologist. 
Only cases with a diagnosis of definite OA were included in the study. Only individuals of Caucasian 
origin were included. The cases consisted of 228 individuals with definite knee OA, 106 individuals with 
definite hip OA, and 74 individuals with hip OA. 



TABLE 7 



Phenotype 


Case 


Control 


Age at Visit 


62.7 


52.5 


Sex (Female/Male) 


227/119 


174/101 


Knee OA Xray: No 

Unknown 
Yes 


35% (120) 


80% (16) 


1%(4) 


0% (0) 


64% (221 ) 


20% (4) 


Hip OA Xray: No 

Unknown 
Yes 


63% (215) 


80% (16) 


2% (7) 


0% (0) 


35% (121) 


20% (4) 



Assay for Verifying. Allelotvping, and Genotyping SNPs 

[0216] Genotyping of the replication cohorts described in Tables 6 and 7 was performed using the 
same methods used for the original genotyping, as described herein. A MassARRAY™ system 
(Sequenom, Inc.) was utilized to perform SNP genotyping in a high-throughput fashion. This genotyping 
platform was complemented by a homogeneous, single-tube assay method (hME™ or homogeneous 
MassEXTEND™ (Sequenom, Inc.)) in which two genotyping primers anneal to and amplify a genomic 
target surrounding a polymorphic site of interest. A third primer (the MassEXTEND™ primer), which is 
complementary to the amplified target up to but not including the polymorphism, was then enzymatically 
extended one or a few bases through the polymorphic site and then terminated. 

[0217] For each polymorphism, SpectroDESIGNER™ software (Sequenom, Inc.) was used to 
generate a set of PCR primers and a MassEXTEND™ primer which where used to genotype the 



62 



PATENT 
Docket 524593009100 



polymorphism. Other primer design software could be used or one of ordinary skill in the art could 
manually design primers based on his or her knowledge of the relevant factors and considerations in 
designing such primers. Table 3 shows PCR primers and Table 4 shows extension probes used for 
analyzing (e.g., genotyping) polymorphisms in the replication cohorts. The initial PCR amplification 
reaction was performed in a 5 ul total volume containing 1 X PCR buffer with 1 .5 mM MgCl2 (Qiagen), 
200 uM each of dATP, dGTP, dCTP, dTTP (Gibco-BRL), 2.5 ng of genomic DNA, 0.1 units of HotStar 
DNA polymerase (Qiagen), and 200 nM each of forward and reverse PCR primers specific for the 
polymorphic region of interest. 

[0218] Samples were incubated at 95°C for 1 5 minutes, followed by 45 cycles of 95°C for 20 
seconds, 56°C for 30 seconds, and 72°C for 1 minute, finishing with a 3 minute final extension at 72°C. 
Following amplification, shrimp alkaline phosphatase (SAP) (0.3 units in a 2 ul volume) (Amersham 
Pharmacia) was added to each reaction (total reaction volume was 7 ul) to remove any residual dNTPs 
that were not consumed in the PCR step. Samples were incubated for 20 minutes at 37°C, followed by 5 
minutes at 85°C to denature the SAP. 

[0219] Once the SAP reaction was complete, a primer extension reaction was initiated by adding a 
polymorphism-specific MassEXTEND™ primer cocktail to each sample. Each MassEXTEND™ 
cocktail included a specific combination of dideoxynucleotides (ddNTPs) and deoxynucleotides (dNTPs) 
used to distinguish polymorphic alleles from one another. Methods for verifying, allelotyping and 
genotyping SNPs are disclosed, for example, in U.S. Pat. No. 6,258,538, the content of which is hereby 
incorporated by reference. In Table 7, ddNTPs are shown and the fourth nucleotide not shown is the 
dNTP. 

[0220] The MassEXTEND™ reaction was performed in a total volume of 9 ul, with the addition of 
IX ThermoSequenase buffer, 0.576 units of ThermoSequenase (Amersham Pharmacia), 600 nM 
MassEXTEND™ primer, 2 mM of ddATP and/or ddCTP and/or ddGTP and/or ddTTP, and 2 mM of 
dATP or dCTP or dGTP or dTTP. The deoxy nucleotide (dNTP) used in the assay normally was 
complementary to the nucleotide at the polymorphic site in the amplicon. Samples were incubated at 
94°C for 2 minutes, followed by 55 cycles of 5 seconds at 94°C, 5 seconds at 52°C, and 5 seconds 
at 72°C. 

[0221] Following incubation, samples were desalted by adding 1 6 ul of water (total reaction volume 
was 25 ul), 3 mg of SpectroCLEAN™ sample cleaning beads (Sequenom, Inc.) and allowed to incubate 
for 3 minutes with rotation. Samples were then robotically dispensed using a piezoelectric dispensing 
device (SpectroJET™ (Sequenom, Inc.)) onto either 96-spot or 384-spot silicon chips containing a matrix 
that crystallized each sample (SpectroCHTP™ (Sequenom, Inc.)). Subsequently, MALDI-TOF mass 
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spectrometry (Biflex and Autoflex MALDI-TOF mass spectrometers (Bruker Daltonics) can be used) and 
SpectroTYPER RT™ software (Sequenom, Inc.) were used to analyze and interpret the SNP genotype for 
each sample. 

Genetic Analysis 

[0222] Genotyping results for replication cohorts #1 and #2 are provided in Tables 8 and 9, 
respectively. 



TABLES 



rsID 


Replication #1 
(Mixed Male/Female cases and Female controls) 

AF OA Con AF OA Cas Delta P-value 


Meta-analysis 
Disc. + Rep #1 
P-value 


rs1 484086 


0.23 


0.20 


0.03 


0.287 


0.0077 



TABLE 9 



rsID 


Replication #2 (Newfoundland) 
(Male/Female cases and controls) 

AF OA Con AF OA Cas Delta P-value 


Meta-analysis 
Disc. + Rep #2 
Not Done 


rs1 484086 


0.23 


0.21 


0.026 


0.280 





[0223] To combine the evidence for association from multiple sample collections, a meta-analysis 
procedure was employed. The allele frequencies were compared between cases and controls within the 
discovery sample, as well as within the replication cohort #1 using the DerSimian-Laird approach 
(DerSimonian, R. and N. Laird. 1986. Meta-analysis in clinical trials. Control Clin Trials 7: 177-188.) 

[0224] The absence of a statistically significant association in one or more of the replication cohorts 
should not be interpreted as minimizing the value of the original finding. There are many reasons why a 
biologically derived association identified in a sample from one population would not replicate in a 
sample from another population. The most important reason is differences in population history. Due to 
bottlenecks and founder effects, there may be common disease predisposing alleles present in one 
population that are relatively rare in another, leading to a lack of association in the candidate region. 
Also, because common diseases such as arthritis-related disorders are the result of susceptibilities in 
many genes and many environmental risk factors, differences in population-specific genetic and 
environmental backgrounds could mask the effects of a biologically relevant allele. For these and other 
reasons, statistically strong results in the original, discovery sample that did not replicate in one or more 
of the replication samples may be further evaluated in additional replication cohorts and experimental 
systems. 
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Example 4 
TM7SF3 Region Proximal SNPs 

[0225J It has been discovered that SNP rs 1484086 in TM7SF3 is associated with occurrence of 
osteoarthritis in subjects. TM7SF3 is an orphan receptor and is a member of the superfamily of 7- 
transmembrane domain proteins, one of the largest superfamilies of cell surface proteins. Members of 
this family include receptors for a variety of ligands, such as peptides, hormones, and ions, and for 
external sensory stimuli, such as odorants and light. Many 7-transmembrane molecules are able to recruit 
small G proteins, suggesting that they can transduce external signals to the cytoplasm. 

[0226] Thirty-seven additional allelic variants proximal to rs 1484086 were identified and 
subsequently allelotyped in osteoarthritis case and control sample sets as described in Examples 1 and 2. 
The polymorphic variants are set forth in Table 10. The chromosome positions provided in column four 
of Table 10 are based on Genome "Build 34" of NCBI's GenBank. 



TABLE 10 



dbSNP 
rs# 


Chromosome 


Position in SEQ 
ID NO: 1 


Chromosome 
Position 


Allele 
Variants 


rs1 058701 


12 


230 


27004780 


a/c 


rs11613 


12 


231 


27004781 


c/q 


rs900743 


12 


5330 


27009880 


a/t 


rs2 129091 


12 


6334 


27010884 


a/c 


rs15556 


12 


11372 


27015922 


C/t 


rs1 053724 


12 


11456 


27016006 


Q/t 


rs9699 


12 


11501 


27016051 


a/q 


rs4856 


12 


13393 


27017943 


c/t 


rs3782314 


12 


16666 


27021216 


a/q 


rs2087736 


12 


17596 


27022146 


c/t 


rs2068372 


12 


19710 


27024260 


a/g 


rs2068371 


12 


19800 


27024350 


a/q 


rs1 184529 


12 


20297 


27024847 


a/q 


rs2101256 


12 


20967 


27025517 


Q/t 


rs1 552257 


12 


32514 


27037064 


c/t 


rs 1565585 


12 


33159 


27037709 


a/q 


rs4080800 


I" 12 


37600 


27042150 


a/q 


rs1 388658 


12 


41259 


27045809 


a/g 


rs3782316 


12 


41329 


27045879 


c/t 


rs1 484086 


12 


50060 


27054610 


c/t 


rs 1484087 


12 


53292 


27057842 


c/t 


rs2291549 


12 


53393 


27057943 


a/q 


rs1 565584 


12 


56417 


27060967 


c/t 


rs3071166 


12 


56435 


27060985 


-/tt 


rs1 907652 


12 


58847 


27063397 


a/t 


rs3759119 


12 


59595 


27064145 


c/t 


rs3759120 


12 


59661 


27064211 


q/t 
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dbSNP 

rs# 


Chromosome 


Position in SEQ 
rD NO: 1 


Chromosome 
Position 


Allele 
Variants 


rs3782317 


12 


60355 


27064905 


c/t 


rs3825166 


12 


60407 


27064957 


a/c 


rs1872191 


12 


62357 


27066907 


a/q 


rs8628 


12 


68230 


27072780 


fl« 


rs1872193 


12 


68516 


27073066 


a/q 


rs1 042833 


12 


69055 


27073605 


c/t 


rs2476 


12 


72603 


27077153 


c/q 


rs1 976206 


12 


73928 


27078478 


a/q 


rs1 388659 


12 


85897 


27090447 


c/t 


rs1 184528 


12 


91554 


27096104 


a/q 



Assay for Verifying and Allelotyping SNPs 

[0227] The methods used to verify and allelotype the 37 proximal SNPs of Table 10 are the same 
methods described in Examples 1 and 2 herein. The primers and probes used in these assays are provided 
in Table 1 1 and Table 12, respectively. 



TABLE 11 



dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs1 058701 


ACGTTGGATGCTTCTCTCCAGTCCATGTTG 


ACGTTGGATGGTAACAAGTCCGAAGGATTC 


rs11613 


ACGTTGGATGTCCAAGTGCCTTCTCTCCAG 


ACGTTGGATGTCCTTGATGCATCTCGACAC 


rs900743 


ACG I I GGA I GAACAGA I CCGGAAC I 1 I I I I 


ACGI 1 GGA 1 GGGC 1 CA 1 AGAACCC 1 1 1 1 1 1 


rs2 129091 


ACGTTGGATGTTAAAATTGGTATGGTCTCC 


ACGTTGGATGGTTTTGCACTAAGAAGAGAC 


rs15556 


ACGTTGGATGTGTAGGGACAAAGTTATATGG 
AA 


ACGTTGGATGTGTCTTTACAGTCTTCACTGGC 
A 


rs1 053724 


ACGTTGGATGCCATATAACTTTGTCCCTAC 


ACGTTGGATGAAAATACAGTGCAGGTGACC 


rs9699 


ACGTTGGATGTTTCAGTGTTACTGAAGTTAAT 
T 


ACGTTGGATGCAACATAGAAAATATAAAACAA 
TTT 


rs4856 


ACGTTGGATGTTGTAAACCAAAAGGTAATTTC 
TCA 


ACGTTGGATGGATGACCAAAAGGTTGGTGG 


rs3782314 


ACGTTGGATGAATGCTGCACAACTTAGGCC 


A C GTTG G ATG AG C AGTC AG CCTATTCTTG C 


rs2087736 


ACGTTGGATGCAGTGAGACTCTGTCTCTAC 


ACGTTGGATGTACATCAGCCTCCCAAGTAG 


rs2068372 


ACGTTGGATGCTAGAGAGGGTTGCTCTATG 


ACGTTGGATGCCAGCTTCAGCTTTCTGTAG 


rs2068371 


ACGTTGGATGATTGACAAGCCAAGCCTGTC 


ACGTTGGATGATAGAGCAACCCTCTCTAGG 


rs1 184529 


ACGTTGGATGTTCCTTTCCTGGTGGTTTGC 


ACGTTGGATGGGTGTCAGTCTTGTAATCAG 


rs2101256 


ACGI IGGAIGCIGGGI 1 1 1 1 1 IGGIGIGI IG 


ACGTTGGATGCTTGTACTGTATCTCAATTTC 


rs1 552257 


ACGTTGGATGCAATACCTTAGCTCACCTAC 


ACGTTGGATGCATTAGGGAAAATCTGGATC 


rs1 565585 


ACGTTGGATGGCTAGAATATACCACAGGTG 


ACGTTGGATGTCACTGAAAATGGATCAGCC 


rs4080800 


ACGTTGGATGGGGTTCAAGCAATTCTCCTG 


ACGTTGGATGAAAATTAGCCAGGTGTGGTG 


rs 1388658 


ACGTTGGATGGAAAAGAGAGAGGATGGGTG 


ACGTTGGATGCAGGAGCTTAAGAACCACTG 


rs3782316 


ACGTTGGATGGTCTTGCTGTAATGTCATTAG 


ACGTTGGATGCTTCTC 1 1 1 1 1 1 AACTAGATC 


rs1484086 


ACGTTGGATGTGTCACTCTTCGGAAGTCTC 


ACGTTGGATGCATGTACAGGGCATTCACAG 


rs1 484087 


ACG 1 1 GGA 1 GGCA 1 GGAA 1 1 1 1 1 ACACCCC 


ACGTTGGATGTTCCCTCTCAAACTCTGTAG 


rs2291549 


ACGTTGGATGACTAAAGCGCCTCTTTGCTG 


ACGTTGGATGCATTAAGCCGCTAGTTCCAC 


rs1 565584 


ACGTTGG ATGG AAAAG GC AAGTTATG ATG C 


ACGTTGGATGAGTATCCATTCCTCTGAGTC 
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dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs3071 1 66 


ACGTTGGATGG AAAAGGCAAGTTATG ATG C 


ACGTTGGATGAGTATCCATTOOTCTGAGTC 


rsl 907652 


ACGTT GGATGCCATA 1 1 1 CCAGTACCTAGC 


ACGTTGGATGGGTTGATAAATATGTGCCAG 


rs3759119 


ACGTTG G ATG CTAAACTGTTCTT AGTG C C G 


ACGTTGGATGGGGAA 1 1 1 CAAAGAGTGTGG 


rs3759120 


ACGTTGGATGCGGCACTAAGAACAG 1 1 1 AGA 


ACGTTGGATGGG 1 1 1 1 1 ATGACTGTAGCAAC 


rs3782317 


ACGTTGGATGGGAAGCTC TTGAAGCTGTAG 


AGGTTGGATGGCCATTGAGAAATCCTGAGC 


rs3825166 


ACGTTGGATGACCAGACTCTCAACTTAGCC 


ACGTTGGATGTGGCAGGTGGGTTATTCTTG 




AOu 1 1 otiA Ivj 1 1 1 A 1 tjl/WjAAuljOl/ lulUI 




rs8628 


ACGTTGGATGCAAGCCTTACCAACAATTACA 
GAA 


ACGTTGGATGGCTTATCAAGAGTGAAAATAGA 
AGA 


rs1872193 


ACGTTGGATGAATGCCACCTCTAAGAGGCA 


ACGTTGGATGCTCAAGCCAAAGGAGAAGAC 


rs1 042833 


ACGTTG G ATGTC ATCC AAC CCTTGGATCTC 


ACGTTG G ATG ATAC AG C C CC AGG AAATAG C 


rs2476 


ACGTTGGATGAGGAGCAGGTGACGTTAATG 


ACGTTGGATGATTGACTAGCCACCAGGAAG 


rs1 976206 


ACGTTGGATGAGCTGGGATTGGATTACAGG 


ACGTTGGATGATGGAGAAACCCCGTCTCTA 


rs1 388659 


ACGTTGGATGACCTCAGCCTTCCACATTTG 


AC GTTG G ATG AG C AG G AGG AAACTTTTG G G 


rs1 184528 


ACGTTGGATGTGAGAATCGCTTGAACCCAG 


ACGTTGGATGTTTGAATCAGAGTCTCCCTC 



TABLE 12 



dbSNP 


Extend 


Term 


rs# 


Primer 


Mix 


rs1 058701 


CCAGTCCATGTTGAGGTGC 


ACT 


rs11613 


CCAGTCCATGTTGAGGTG 


ACT 


rs900743 


ATCCGGAAC I I I I I I I I I GAAATT 


ACT 


rs2 129091 


ATTTAATTTGAAAAGTGCTTACCC 


ACT 


rs15556 


Gl ! ICS MAI I I ICIAIAC 


ACG 


rs1 053724 


C ATC AAG TTTC AGTGTT A 


CGT 


rs9699 


TACAGAATAAGCAGTAAA 


ACG 


rs4856 


C AAGTATTTG G AAATAAG 


ACT 


rs3782314 


ACAAATATATGAGAACTCCTCTTT 


ACT 


rs2087736 


ACTCTGTCTCTACTAAAAATAAAA 


ACT 


rs2068372 


TGGCTTCATGGCACCACTG 


ACT 


rs2068371 


AGCCAAGCCTGTCACTGGCCCT 


ACT 


rs1 184529 


TCCTGGTGGTTTGCCACTTA 


ACT 


rs2101256 


Gl I I I I I IGGIGIGI IGAIAIGIA 


ACT 


rs1 552257 


AGCTCACCTACGAAAATGAATAA 


ACG 


rs1 565585 


TGTCAGACTGCACTACAT 


ACT 


rs4080800 


ATTCTCCTGTCTCAGCCTCC 


ACG 


rs1 388658 


GAGAGGATGGGTGAAATAAGG 


ACT 


rs3782316 


TGTAATGTCATTAGGAAGAAACA 


ACG 


rs1 484086 


CTCTTCGGAAGTCTCTTTCTCA 


ACT 


rs1 484087 


TTTACACCCCCAAATCTAGAG 


ACT 


rs2291549 


CCTCTTTGCTGCCCAGTGG 


ACT 


rs1 565584 


TGATGCAATAAGTATATATAGTAC 


ACT 


rs3071166 


IAUIACGIGGCI I I I I I I I I I I I I 


CGT 


rs1 907652 


TCCAGTACCTAGCACCTAAC 


CGT 


rs3759119 


TTAGTGCCGGGATGAATAACT 


ACT 



67 



PATENT 
Docket 524593009100 



dbSNP 


Extend 


Term 


rsff 


Primer 


IVI1X 


rs3759120 


AAATTGCAACTGTGAGTATTAAAG 


ACT 


rs3782317 


AGAATAACCCACCTGCCAAAT 


ACT 


rs3825166 


ACTTAGCCTACATTTGAAAAGGG 


ACT 


rs1872191 


CTCAGTCCGCTCCCCACTT 


ACG 


rs8628 


TTAAACACTATGACACAT 


ACG 


rs1872193 


AGAGGCAGGACACTAGCC 


ACG 


rs1 042833 


CCCTTGGATCTCTTTGAG 


ACG 


rs2476 


GTGACGTTAATGGGACAGCT 


ACT 


rs1 976206 


TACAGGCGCCCACCACCA 


ACG 


rs1 388659 


CCACAI I IGGIAAGI I I IGACAI 


ACT 


rs1 184528 


AGAGGTTGCAGAGAGCCAAGATC 


ACT 



Genetic Analysis 

[0228] Allelotyping results from the discovery cohort are shown for cases and controls in Table 13. 
The allele frequency for the A2 allele is noted in the fifth and sixth columns for osteoarthritis case pools 
and control pools, respectively, where "AF" is allele frequency. The allele frequency for the Al allele 
can be easily calculated by subtracting the A2 allele frequency from 1 (Al AF = 1-A2 AF). For example, 
the SNP rs900743 has the following case and control allele frequencies: case Al (G) = 0.92; case A2 (T) 
= 0.08; control Al (G) = 0.90; and control A2 (T) = 0.10, where the nucleotide is provided in paranthesis. 
Some SNPs are labeled "untyped" because of failed assays. 



TABLE 13 



dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs1 058701 


230 


27004780 


A/C 








rs11613 


231 


27004781 


C/G 








rs900743 


5330 


27009880 


G/T 


0.08 


0.10 


0.381 


rs2 129091 


6334 


27010884 


A/C 


0.19 


0.20 


0.646 


rs 15556 


11372 


27015922 


err 


0.74 


0.74 


0.923 


rs 1053724 


11456 


27016006 


G/T 


0.14 


0.15 


0.592 


rs9699 


11501 


27016051 


A/G 


0.67 


0.68 


0.595 


rs4856 


13393 


27017943 


CfT 


0.10 


0.09 


0.474 


rs3782314 


16666 


27021216 


A/G 


0.20 


0.22 


0.278 


rs2087736 


17596 


27022146 


C/T 








rs2068372 


19710 


27024260 


A/G 


0.11 


0.10 


0.634 


rs2068371 


19800 


27024350 


A/G 








rs1 184529 


20297 


27024847 


A/G 








rs2101256 


20967 


27025517 


G/T 








rs1 552257 


32514 


27037064 


err 


0.67 


0.67 


0.969 


rs 1565585 


33159 


27037709 


A/G 


0.69 


0.65 


0.064 


rs4080800 


37600 


27042150 


A/G 








rs 1388658 


41259 


27045809 


A/G 


0.35 


0.38 


0.362 


rs3782316 


41329 


27045879 


err 


0.47 


0.46 


0.740 


rs1 484086 


50060 


27054610 


CfT 
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Hh^NP 
rs# 


Position in 
SEQ ID NO: 
1 


V-Ill U1I1U3U111C 

Position 


A1/A2 
Allele 


F A2 

Pocp \V 
V_ . .1 > L f\l 


F A2 
Control AF 


F D- 

Value 


rs1 484087 


53292 


27057842 


C/T 


0.35 


0.38 


0.127 


rs2291549 


53393 


27057943 


A/G 








rs1 565584 


56417 


27060967 


C/T 


0.66 


0.69 


0.327 


rs3071166 


56435 


27060985 


-m 


0.52 


0.47 


0.042 


rs1 907652 


58847 


27063397 


AJT 


0.64 


0.64 


0.984 


rs3759119 


59595 


27064145 


CfT 


0.08 


0.05 


0.042 


rs3759120 


59661 


27064211 


G/T 


0.15 


0.14 


0.861 


rs3782317 


60355 


27064905 


C/T 


0.08 


0.07 


0.780 


rs3825166 


60407 


27064957 


A/C 


0.83 


0.84 


0.660 


rs1872191 


62357 


27066907 


A/G 


0.04 


0.05 


0.555 


rs8628 


68230 


27072780 


G/T 


0.59 


0.61 


0.411 


rs 1872 193 


68516 


27073066 


A/G 


0.92 


0.91 


0.669 


rs1 042833 


69055 


27073605 


C/T 


0.85 


0.86 


0.605 


rs2476 


72603 


27077153 


C/G 


0.81 


0.82 


0.392 


rs 1976206 


73928 


27078478 


A/G 


0.16 


0.15 


0.781 


rs1 388659 


85897 


27090447 


C/T 


0.49 


0.50 


0.692 


rs1 184528 


91554 


27096104 


A/G 


0.20 


0.17 


0.228 



[0229] The TM7SF3 proximal SNPs were also allelotyped in the replication cohorts using the 
methods described herein and the primers provided in Tables 1 1 and 12. The replication allelotyping 
results for replication cohort #1 and replication cohort #2 are provided in Tables 14 and 15, respectively. 



TABLE 14 



dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs 1058701 


230 


27004780 


A/C 








rs11613 


231 


27004781 


C/G 








rs900743 


5330 


27009880 


G/T 


0.07 


0.11 


0.258 


rs2 129091 


6334 


27010884 


A/C 


0.18 


0.20 


0.450 


rs15556 


11372 


27015922 


C/T 


0.73 


0.74 


0.858 


rs1 053724 


11456 


27016006 


G/T 


0.15 


0.15 


0.969 


rs9699 


11501 


27016051 


A/G 


0.65 


0.68 


0.290 


rs4856 


13393 


27017943 


C/T 


0.11 


0.10 


0.735 


rs3782314 


16666 


27021216 


A/G 


0.18 


0.22 


0.171 


rs2087736 


17596 


27022146 


u C/T 








rs2068372 


19710 


27024260 


A/G 


0.11 


0.11 


0.867 


rs2068371 


19800 


27024350 


A/G 








rs1 184529 


20297 


27024847 


A/G 








rs2101256 


20967 


27025517 


G/T 








rs1 552257 


32514 


27037064 


C/T 


0.70 


0.66 


0.239 


rs1 565585 


33159 


27037709 


A/G 


0.67 


0.65 


0.599 


rs4080800 


37600 


27042150 


A/G 








rs1 388658 


41259 


27045809 


A/G 


0.36 


NA 


NA 


rs3782316 


41329 


27045879 


C/T 


0.45 


0.49 


0.230 


rs1484086 


50060 


27054610 


C/T 








rs1 484087 


53292 


27057842 


C/T 


0.32 


0.40 


0.023 


rs2291549 


53393 


27057943 


A/G 








rs1 565584 


56417 


27060967 


CfT 


0.61 


0.65 


0.297 


rs3071166 


56435 


27060985 


-fTT 


0.55 


0.45 


0.003 


rs1 907652 


58847 


27063397 


AJT 


0.62 


NA 


0.638 


rs3759119 


59595 


27064145 


CfT 


0.09 


0.04 


0.025 


rs3759120 


59661 


27064211 


G/T 


0.16 


0.15 


0.651 



69 



PATENT 
Docket 524593009100 





Position in 
SEQ ID NO: 
1 


Chromosome 

rualUUD 


A 1 /A? 

A 11 ala 

Allele 


IT A 7 
r >\i 


r /vz 
control At 


F n- 

Value 


rs3782317 


60355 


27064905 


C/T 


0.07 


0.07 


0.986 


rs3825166 


60407 


27064957 


A/C 


0.84 


0.84 


0.945 


rs1872191 


62357 


27066907 


A/G 


0.04 


0.06 


0.562 


rs8628 


68230 


27072780 


G/T 


0.58 


0.64 


0.060 


rs1872193 


68516 


27073066 


A/G 


0.92 


0.91 


0.957 


rs1 042833 


69055 


27073605 


c/t 


0.85 


0.86 


0.746 


rs2476 


72603 


27077153 


C/G 


0.81 


0.84 


0.397 


rs1 976206 


73928 


27078478 


A/G 


0.17 


0.16 


0.788 


rs1 388659 


85897 


27090447 


C/T 


0.47 


0.50 


0.283 


rs1 184528 


91554 


27096104 


A/G 


0.21 


0.16 


0.170 



TABLE 15 



Q Doll r 

■ art- 


Position in 
SEQ ID NO: 
1 


Chromosome 

» ualliuil 


/A I / /AX 

Allele 


F Al 

f * r\ r n A X." 
V ;l S I AT 


F A1 

I /AX 

t^oniroi Ar 


F n- 
1 P 

Value 


rs1 058701 


230 


I 27004780 


A/C 








rs11613 


231 


27004781 


C/G 








rs900743 


5330 


27009880 


L G/T 


0.10 


0.09 


0.763 


rs2 129091 


6334 


27010884 


A/C 


0.21 


0.20 


0.896 


rs 15556 


[ 11372 


27015922 


C/T 


0.75 


0.75 


0.974 


rs1 053724 


11456 


27016006 


G/T 


0.12 


0.15 


0.373 


rs9699 


11501 


27016051 


A/G 


0.69 


0.67 


0.615 


rs4856 


13393 


27017943 


C/T 


0.10 


0.08 


0.413 


rs3782314 


16666 


27021216 


A/G 


0.22 


0.22 


0.944 


rs2087736 


17596 


27022146 


C/T 








rs2068372 


19710 


27024260 


A/G 


0.11 


0.10 


0.561 


rs2068371 


19800 


27024350 


A/G 








rs1 184529 


20297 


27024847 


A/G 








rs2101256 


20967 


27025517 


G/T 








rs1 552257 


32514 


27037064 


C/T 


0.63 


0.68 


0.247 


rs1 565585 


33159 


27037709 


A/G 


0.73 


0.66 


0.029 


rs4080800 


37600 


27042150 


A/G 








rs 1388658 


41259 


27045809 


A/G 


0.33 


0.38 


0.217 


rs3782316 


41329 


27045879 


C/T 


0.51 


0.42 


0.036 


rs1 484086 


50060 


27054610 


C/T 








rs1 484087 


53292 


27057842 


C/T 


0.38 


0.36 


0.575 


rs2291549 


53393 


27057943 


A/G 








rs1 565584 


56417 


27060967 


C/T 


0.73 


0.74 


0.635 


rs3071166 


56435 


27060985 


-m 


0.48 


0.49 


0.727 


rs 1907652 


58847 


27063397 


ATT 


0.66 


-0.02 




rs3759119 


59595 


27064145 


C/T 


0.06 


0.05 


0.669 


rs3759120 


59661 


27064211 


G/T 


0.13 


0.14 


0.881 


rs3782317 


60355 


27064905 


C/T 


0.08 


0.07 


0.655 


rs3825166 


60407 


27064957 


A/C 


0.82 


0.84 


0.546 


rs1872191 


62357 


27066907 


A/G 


0.04 


0.04 


0.894 


rs8628 


68230 


27072780 


G/T 


0.61 


0.56 


0.234 


rs1872193 


68516 


27073066 


A/G 


0.93 


0.91 


0.528 


rs1 042833 


69055 


27073605 


C/T 


0.85 


0.86 


0.682 


rs2476 


72603 


27077153 


C/G 


0.80 


0.81 


0.829 


rs1 976206 


73928 


27078478 


A/G 


0.14 


0.14 


0.834 


rs1 388659 


85897 


27090447 


C/T 


0.51 


0.48 


0.499 


rs1 184528 


91554 


27096104 


A/G 


0.18 


0.18 


0.993 



[0230] Allelotyping results were considered particularly significant with a calculated p-value of less 
than or equal to 0.05 for allelotype results. These values are indicated in bold. The allelotyping p-values 
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were plotted in Figure 1 for the discovery cohort. The position of each SNP on the chromosome is 
presented on the x-axis. The y-axis provides the negative logarithm (base 10) of the p-value comparing 
the estimated allele in the case group to that of the control group. The minor allele frequency of the 
control group for each SNP designated by an X or other symbol on the graphs in Figure 1 can be 
determined by consulting Table 13. For example, the left-most X on the left graph is at position 
27004780. By proceeding down the Table from top to bottom and across the graphs from left to right the 
allele frequency associated with each symbol shown can be determined. 

[0231] To aid the interpretation, multiple lines have been added to the graph. The broken horizontal 
lines are drawn at two common significance levels, 0.05 and 0.01 . The vertical broken lines are drawn 
every 20kb to assist in the interpretation of distances between SNPs. Two other lines are drawn to 
expose linear trends in the association of SNPs to the disease. The light gray line (or generally bottom- 
most curve) is a nonlinear smoother through the data points on the graph using a local polynomial 
regression method (W.S. Cleveland, E. Grosse and W.M. Shyu (1992) Local regression models. Chapter 
8 of Statistical Models in S eds J.M. Chambers and T.J. Hastie, Wadsworth & Brooks/Cole.). The black 
line provides a local test for excess statistical significance to identify regions of association. This was 
created by use of a lOkb sliding window with lkb step sizes. Within each window, a chi-square 
goodness of fit test was applied to compare the proportion of SNPs that were significant at a test wise 
level of 0.01 , to the proportion that would be expected by chance alone (0.05 for the methods used here). 
Resulting p- values that were less than 10" 8 were truncated at that value. 

[0232] The exons and introns of the genes in the covered region are plotted below each graph at the 
appropriate chromosomal positions. The gene boundary is indicated by the broken horizontal line. The 
exon positions are shown as thick, unbroken bars. An arrow is place at the 3' end of each gene to show 
the direction of transcription. 

Example 5 

In Vitro Production of Target Polypeptides 

[0233] cDNA is cloned into a pIVEX 2.3-MCS vector (Roche Biochem) using a directional cloning 
method. A cDNA insert is prepared using PCR with forward and reverse primers having 5 ' restriction 
site tags (in frame) and 5-6 additional nucleotides in addition to 3' gene-specific portions, the latter of 
which is typically about twenty to about twenty-five base pairs in length. A Sal I restriction site is 
introduced by the forward primer and a Sma I restriction site is introduced by the reverse primer. The 
ends of PCR products are cut with the corresponding restriction enzymes (i.e., Sal I and Sma I) and the 
products are gel-purified. The pIVEX 2.3-MCS vector is linearized using the same restriction enzymes, 
and the fragment with the correct sized fragment is isolated by gel-purification. Purified PCR product is 
ligated into the linearized pIVEX 2.3-MCS vector and E. coli cells transformed for plasmid 
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amplification. The newly constructed expression vector is verified by restriction mapping and used for 
protein production. 

[0234] E. coli lysate is reconstituted with 0.25 ml of Reconstitution Buffer, the Reaction Mix is 
reconstituted with 0.8 ml of Reconstitution Buffer; the Feeding Mix is reconstituted with 10.5 ml of 
Reconstitution Buffer; and the Energy Mix is reconstituted with 0.6 ml of Reconstitution Buffer. 0.5 ml 
of the Energy Mix was added to the Feeding Mix to obtain the Feeding Solution. 0.75 ml of Reaction 
Mix, 50 ul of Energy Mix, and 10 ug of the template DNA is added to the E. coli lysate. 

[0235] Using the reaction device (Roche Biochem), 1 ml of the Reaction Solution is loaded into the 
reaction compartment. The reaction device is turned upside-down and 1 0 ml of the Feeding Solution is 
loaded into the feeding compartment. All lids are closed and the reaction device is loaded into the 
RTS500 instrument. The instrument is run at 30°C for 24 hours with a stir bar speed of 1 50 rpm. The 
pIVEX 2.3 MCS vector includes a nucleotide sequence that encodes six consecutive histidine amino 
acids on the C-terminal end of the target polypeptide for the purpose of protein purification. Target 
polypeptide is purified by contacting the contents of reaction device with resin modified with Ni 2+ ions. 
Target polypeptide is eluted from the resin with a solution containing free Ni 2+ ions. 

Example 6 

Cellular Production of Target Polypeptides 

[0236] Nucleic acids are cloned into DNA plasmids having phage recombination cites and target 
polypeptides are expressed therefrom in a variety of host cells. Alpha phage genomic DNA contains 
short sequences known as attP sites, and E. coli genomic DNA contains unique, short sequences known 
as attB sites. These regions share homology, allowing for integration of phage DNA into E. coli via 
directional, site-specific recombination using the phage protein Int and the E. coli protein IHF. 
Integration produces two new att sites, L and R, which flank the inserted prophage DNA. Phage excision 
from E. coli genomic DNA can also be accomplished using these two proteins with the addition of a 
second phage protein, Xis. DNA vectors have been produced where the integration/excision process is 
modified to allow for the directional integration or excision of a target DNA fragment into a backbone 

vector in a rapid in vitro reaction (Gateway™ Technology (Invitrogen, Inc.)). 

[0237] A first step is to transfer the nucleic acid insert into a shuttle vector that contains attL sites 
surrounding the negative selection gene, ccdB {e.g. pENTER vector, Invitrogen, Inc.). This transfer 
process is accomplished by digesting the nucleic acid from a DNA vector used for sequencing, and to 
ligate it into the multicloning site of the shuttle vector, which will place it between the two attL sites 
while removing the negative selection gene ccdB. A second method is to amplify the nucleic acid by the 
polymerase chain reaction (PCR) with primers containing attB sites. The amplified fragment then is 

72 



\ 



PATENT 
Docket 524593009100 

integrated into the shuttle vector using Int and IHF. A third method is to utilize a topoisomerase- 
mediated process, in which the nucleic acid is amplified via PCR using gene-specific primers with the 5 ' 

upstream primer containing an additional CACC sequence (e.g., TOPO® expression kit (Invitrogen, 
Inc.)). In conjunction with Topoisomerase I, the PCR amplified fragment can be cloned into the shuttle 
vector via the attL sites in the correct orientation. 

[0238] Once the nucleic acid is transferred into the shuttle vector, it can be cloned into an expression 
vector having attR sites. Several vectors containing attR sites for expression of target polypeptide as a 
native polypeptide, N-fusion polypeptide, and C-fusion polypeptides are commercially available (e.g., 
pDEST (Invitrogen, Inc.)), and any vector can be converted into an expression vector for receiving a 
nucleic acid from the shuttle vector by introducing an insert having an attR site flanked by an antibiotic 
resistant gene for selection using the standard methods described above. Transfer of the nucleic acid 
from the shuttle vector is accomplished by directional recombination using Int, IHF, and Xis (LR 
clonase). Then the desired sequence can be transferred to an expression vector by carrying out a 
one hour incubation at room temperature with Int, IHF, and Xis, a ten minute incubation at 37°C with 
proteinase K, transforming bacteria and allowing expression for one hour, and then plating on selective 
media. Generally, 90% cloning efficiency is achieved by this method. Examples of expression vectors 
are pDEST 14 bacterial expression vector with att7 promoter, pDEST 1 5 bacterial expression vector with 
a T7 promoter and a N-terminal GST tag, pDEST 1 7 bacterial vector with a T7 promoter and a N- 
terminal polyhistidine affinity tag, and pDEST 12.2 mammalian expression vector with a CMV promoter 
and neo resistance gene. These expression vectors or others like them are transformed or transfected into 
cells for expression of the target polypeptide or polypeptide variants. These expression vectors are often 
transfected, for example, into murine-transformed a adipocyte cell line 3T3-L1, (ATCC), human 
embryonic kidney cell line 293, and rat cardiomyocyte cell line H9C2. 

Nucleotide and Amino Acid Sequence Embodiments 

[0239] Table A includes information pertaining to the incident polymorphic variant associated with 
osteoarthritis identified herein. Public information pertaining to the polymorphism and the genomic 
sequence that includes the polymorphism are indicated. The genomic sequences identified in Table A 
may be accessed at the http address www.ncbi.nih.gov/entrez/query.fcgi, for example, by using the 
publicly available SNP reference number (e.g., rs 1484086). The chromosome position refers to the 
position of the SNP within NCBI's Genome Build 34, which may be accessed at the following http 
address: www.ncbi.nlm.nm.gov/mapview/map_search.cgi?chr=hum_chr.inf&query=. The "Contig 
Position" provided in Table A corresponds to a nucleotide position set forth in the contig sequence (see 
"Contig Accession No."), and designates the polymorphic site corresponding to the SNP reference 
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number. The sequence containing the polymorphisms also may be referenced by the "Nucleotide 
Accession No." set forth in Table A. The "Sequence Identification" corresponds to cDNA sequence that 
encodes associated target polypeptides (e.g., TM7SF3) of the invention. The position of the SNP within 
the cDNA sequence is provided in the "Sequence Position" column of Table A. If the SNP falls within 
an exon, the corresponding amino acid position (and amino acid change, if applicable) is provided as 
well. Also, the allelic variation at the polymorphic site and the allelic variant identified as associated 
with osteoarthritis is specified in Table A. All nucleotide and polypeptide sequences referenced and 
accessed by the parameters set forth in Table A are incorporated herein by reference. 



Table A 



RSJD 


Chrom- 
osome 


Chrom 
Position 


Contig 
Accession No. 
[1] 


Contig 
Position 


Nucleotide 
Accession 
No. [2] 


Sequence 
Position 


Amino 
Acid 
Position 


Locus 


Locus 
ID 


A [3] 


Allelic 
Variability 


OA 

Assoc. 
Allele 


rs1484086 


12 


27054610 


Hs12 9871 34:16 


19922317 


NM.016551 


intron 




TM7SF3 


51768 


R 


[TIC] 


T 



Til Contig Accession Number which can be found in the NCBI Database: 
http address: www.ncbi.nih.gov/entrez/query.fcgi 

f2 ) Sequence Identification or Nucleotide Accession Number which can be 

found in the NCBI Database: 

http address: www.ncbi.nih.gov/entrez/querv.fcgi 

[31 "A" column is the sequence orientation ("F" is forward. "R" is reverse). 

[0240] The following is a genomic nucleotide sequence for an TM7SF3 region. The following 
nucleotide representations are used throughout: "A" or "a" is adenosine, adenine, or adenylic acid; "C" or 
"c" is cytidine, cytosine, or cytidylic acid; "G" or "g" is guanosine, guanine, or guanylic acid; "T" or "t" is 
thymidine, thymine, or thymidylic acid; and "I" or "i" is inosine, hypoxanthine, or inosinic acid. Exons 
are indicated in italicized lower case type, introns are depicted in normal text lower case type, and 
polymorphic sites are depicted in bold upper case type. SNPs are designated by the following 
convention: "R" represents A or G, "M" represents A or C; "W" represents A or T; "Y" represents C or 
T; "S" represents C or G; "K" represents G or T; "V" represents A, C or G; "H" represents A, C, or T; 
"D" represents A, G, or T; "B" represents C, G, or T; and "N" represents A, G, C, or T. 

TM7SF3 Genomic Sequence (SEP ID NO: 1) 

>12: 27 004551-27096350 

1 acccaaatac taattttcat gttatttttg aacagatgcc tttttggaaa tcttaaaaaa 

61 gcaaacattt caaacagttg gctaacacgc atgaactaac actcattcag tttttaaatt 

121 acggatgata tccgtattaa cattataatg actctatact acagattgac atggtacatc 

181 gtaacaagtc cgaaggattc ttccttgatg catctcgaca catccttgaM Scacctcaac 

241 atggactgga gagaaggcac ttggaagcaa atcagaatgt acactaaata aacagtcaac 

301 ttttggggtg tggatggaag gggggtccat tttaaaagtg cttttacatt gaatttccct 
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361 cccagattag atcagcaaat aaatgaaatt tattaaaaga atgttgtgtc tttaatagcc 

421 atgctgttca gtgtagaaac tttaaatctc tcattaattt tctgtgtatt gtatcatgta 

481 cagcctgtta gatatttgca ggttttatag gttattcagt ccaacgcttt atttatttgc 

541 tcttgttagt tgttgctgtg aaacagtcat ttcctttctg gacatcaaag atattttctt 

601 ggcattaatt ttaaactgat ttcataaggt caaagtatta aatgttttaa ttcagattgt 

661 ggatagtttg tagtcctttg attttgcaaa tgaaattaaa atgcctttgt ttagatgtta 

721 ggattaagta tagtagtgct ctctccattg tgtcttaatt tttttttaac tcacttgtgg 

781 caggtcttta ttgaataaag gaggaataaa cctagacagg gagttagaaa tgaaagacat 

841 ttcaagacag ttcaagggtt atgttggtga cacttcttgt cttccagaca gttttgaaag 

901 cacagatctt gcctcttgtg cttgttaaaa aatctgtgat acttctctga agattttata 

961 attaaataag tgaaccatta ccatgacctt ttataaaagt gctgccctat ttgaagtatt 

1021 gggttttttc tcatgatgag ctttgttaga cttggtactg aaaaaatcag tagttttttt 

1081 ccttctattt taaaggcaaa gaaatgcttg cttcagctag gcatggtggc tcatgcctgt 

1141 aattgggagg ctgaggtggg tggatcacct gacgtcagga attggagacc agcctggcca 

1201 acatggtgaa accccatctc tactaaaaat acaaaaagtt agccaggcgt gatagcgggt 

1261 gcctgtaatc ccagctgctc gggaggctaa agcaagagaa ttgtttgaac ctgggagtgg 

1321 gaggttgcag tgagccgagg tcgtgccatt gcactccagc ctggccaaca agagcgagac 

1381 tctgtctaaa aaaaaagaaa tgcttgcttt ctgtgtagtt ttttaactta ggtcagtgcc 

1441 ataggactga ccaaatcata aattgaggat gctgttggta ttcaaagttt ttactcatta 

1501 cccttactca gtttgcagct gcaatgatct gtacttaacc tgcttttaca tattatgctt 

1561 ccttgacaat gggaggaatt ttaatggtta acttgcagaa tatgacagat acttgtctta 

1621 atatcattct tcctaccata atctaattca aaactatcca gtagaacttt ctgaaatgat 

1681 tggaatgttt tttatctgtg ctgtccagtg tggaagccag tagccacatg tggctgttga 

1741 gcacttgcag tgtcactaat gcgtttgagg gactgaattg ttaattttta aaaatgtatt 

1801 gattttaatt aagtttaaca ttgttttgga cagcacagat tagcctttta gagtaccttc 

1861 ttgctgagta ctagcagtgt ataggtttcc ccatgaaaag gacctacatt tttgttggtt 

1921 aagaggttat taaatgtgct ttgccaaaga tactaccttg tcatttcttc cagcttatta 

1981 gttccgtggg tttcttggag tactgttaaa tcctttttac aaatatatat atacacaact 

2041 actgaagaga ctgtttattt ggggagttat tcacttttat atttaagtct taggcttaat 

2101 atgataaaat acctctcagt tgctgtgaat aatggtctat agaagctgtt cattgagaaa 

2161 atgattattc tcgagtttct gttgatttta aacctttttt gtctgtctct ttcactagac 

2221 tgtaagcttc ttcagatcag gggacatgtc ttgacagtct ttgtattcct gttgtttagc 

2281 atagtactgg cacatcatag gtgtgcaaga taagaaaatg agattggaag aatttatgct 

2341 acttgctgcg cctaacaaat gtggatgtct gtattatact gatactatat tttcaatagt 

2401 tagcaaagaa ggccatttaa tattttaata ctgaaaagca gaattgtgct cattttattc 

2461 aatacttacc atataccata cactagatag taggtggatg taaagatgca gttcctgctc 

2521 tcaaagagga ttatgaccta gaaaaggtta cagattggat tgagataact ttataggcct 

2581 gaacaataca gaaaagattt ctaaatcatt gaagacttca gggaggaaaa gagattgaat 

2641 gcgtcggcaa tttttatcat tcccgttact ttgaacaata gtgtatgtaa gtgcaaaaaa 

2701 aaagaaaaag ttgctgatca ttgtggtgag aggaatgtaa aatatgagta tttttcttgc 

2761 tgctaggttt agttatattg tgctgcttat tcttaattct agtcctaaca aatttgattt 

2821 caaaagcaca tctgtggtca accaacttca ttgttaatat ttgttttatt catattaagt 

2881 gacaatgtaa gaatattcat gttaagtggt actgcagatt ttataaaaag gagaggatcc 

2941 aaaaataaaa aagtattttg ctgatttcca actcccatcc ctctttttca ggaactgcaa 

3001 gcacatgttg accagataac tgaaatggca gcagtaatga ggaaagccat tgaaattgac 

3061 gagcaacagg gttgcaagga acaagaacga atatttcaac ttgaagtaag ttttacattg 

3121 caaatgtaga tgaaaactgt ccatataaag atttatgcca gaaaatatat cccattttta 

3181 aatatatatt tactcttctt gggtactggt aatgttctcc tggatgctgt ctatgtgggc 

3241 atattatgca agaccattta tgtggtgatg caagaatcta ggattattag taacttacaa 

3301 tataccagta atgtagaaaa ttgatgtttc attctatatt gcttatctta tgaagtgaat 

3361 gtacttcttc aattgagtca tttttaaaat tttgtttatt ctattagaga tcatagtgat 

3421 acagggaaat tttttttaat ccacaaatat ttaggggcac tcttattttt ctggggattc 

34 81 taaaaacctt tgtcatctgt cctttatttt atgacagtga aagtaccagt ctctgaataa 

3541 ctgaaatcag tacaatagac ttttatgcct taagagcttc attttgaaat cacggcgggg 

3601 aacctgcttt agactactaa aatttagaaa gtgagactga agataaagtg atagagaaca 

3661 aattatgaaa ggttattgtt aaatgtctac aaattttttt acagtaacta accaaaagga 

3721 ggccagaaag caaagttatt taccagtgcc agatttggtc acttactttc cttcttgaaa 

3781 aatatcatca gctaccccta ccctgctgct ctcaaatgcc aatgagctaa acttgtccta 

3841 cattccatgc tatcatggaa agtaataaaa atactaaaaa ttactaggag attctagcct 

3901 attcactggc tttacgagtt tatataaaaa taaatctaga tgttattttc ttcatctcta 

3961 atgtgattgt aaaataaggc aatttctgag cttctgttta caattatagc tgcttaagta 

4021 agaaggcttt gcagtggtgg aagatgttga acacgattcc tagctgcata agtgttcttt 

4081 agattcaagt tgccaactgc atatatcatt gagtagatta atgccaatta cagaccttgc 

4141 ttgtttgtta taccttcata tgttcagttt ttcctaacat atgtttgtga tgttgtagct 
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4201 accttgtaga tagtgactgc atctttgttt ttgcttgcct tttagcaaga aaacaaaggc 

4261 ttgagagaga tccttcaaat aactcgagaa tcatttttga acctaaggaa agatgatgcg 

4321 tcggaaagta cttctttgtc agcattagtg accaacagtg atttgagtct gaggaagagc 

4381 tgaagagttt ctgagtctgt gagcttctta catggctcca aatggtcaaa taagtgaatg 

4441 aatgaatgga cagaaaattc aatcctttat ttttttctct gtaaatatgt acagtgcacg 

4501 ggctatgaga tagcaacaaa aaatgcatag ttaatggtca tagactttat tccaaaacat 

4561 aattggaaaa tagaaactga gccattgcca aatggtaaag aaatgaaaag ttttcacagt 

4621 gactactgaa tataccaaga gcttttggca gtactgctgg ctttctgggt gattaattag 

4 681 gtaaacttga atattcccag taaatgtttg agaatgcata aaattatacc aatttaaaaa 

4741 tataaatttt tgccactgtt ttgtgaacgg aggatttgta cgctaaattt catccttatt 

4801 tggcgtcaaa gttgatgcaa agtttatgta ccatgagact tctagtgatt gcctactttc 

4861 acatttattt aaatttttgt gaaagaaaat acagttttaa aggctatagg aatgtgttat 

4 921 atgtacctta aacagataat ttaattttag caatttatca taactgatac ttagctaaca 

4 981 attgtccata acttgtttaa gtgtaagaaa taccatttca ggaatgaaaa agaaggaata 

5041 ctactttcta agaaagaaga tcttataata gacatattta gtaggttaaa ctactccttt 

5101 gaaagaataa gttttggttc aaagtcaata atgaagacga gatttgtttt tctcttctct 

5161 ggctgatttt taaggatatt atgtagttca tttagttaac agaattgaaa tgtttgatat 

5221 ggcaagatag gtatggtaat ttcaaagtga attgggaatt cctctggctc atagaaccct 

5281 tttttttttc ctttaagtat tcttgagata caaaaaaaaa aagtaaataM aatttcaaaa 

5341 aaaaagttcc ggatctgttt ttaagctcca tctggtcctc ataacctgca agatttttct 

5401 taaaaccttt cagctgaaag tgggggtaaa ggtggagtaa tctgtggatt tgtttctgtt 

5461 gtcttttaaa atgtcaaata tataatatgt aattttttta aaaaccacca gatacagaaa 

5521 tgtgctttaa catcagttga aacctaaatt ttcttatgtt gtggtgattg tattaaaaag 

5581 ggataaaaga agagtgtcaa acatggttaa atatattgta ctcatttatg ttgaatacgt 

5641 attaaaatta agacaaatgg aaaattatac tttgagtata taatttgtta aatattactt 

5701 tatatggtaa ttttatgtat aatttcatat attggtaaaa ttcaaaacta cacttgagaa 

5761 tttttttatc ttaagtttgg ggtgaatggg gtggatgaga ctgattgaat agaaaagggc 

5821 taatggccca aacattatat agatttcttt ttttcagtca gaggccttat ttgatatttt 

5881 ataaataaat gacagttttt atttttaaac tttttattgt ttttgggaaa gtattcctta 

5941 atttaatgac acattcattc agatacttct tatccctgct aataaaggaa atctatttca 

6001 agctacacca ttgagattaa gtctgaggca gttcattgag gcagctctac tataaaagct 

6061 tacttgataa ataattattt ttgtaaacaa gttgggttaa cttattcttc gtctttttgc 

6121 ttggatatga atttaaggtc ttcatgttta aagacattta ctttgttatt tagtgacaca 

6181 tttccatcct attttttttt tttttttggt tgttgttaaa cagaacctta agtttatgtt 

6241 tgaggtatgt actgcatagg aacctatttt attattaaag atgaatgatt aaaattggta 

6301 tggtctccaa tttaatttga aaagtgctta cccKtattct tatatatggt ttaattttaa 

6361 ggttttttgt ctcttcttag tgcaaaacta cttagcagtg acctctatct gtattcctta 

6421 ggaattagca gcttcttagt gtggatcctg cagaacttct taccatttgt agtaggttga 

6481 atcatgtccc ctagaaggta agtctaagtc ctaacttgat acacctggga aggtgaccat 

6541 atttggaaat agtctttaca gatgtgatta ggggatctca ggatataatt accctggatt 

6601 ttgggtgggc tccaaaccca gtgactagtg tcgttacaat aagagaggag gagatatgaa 

6661 acacagatat agcagcatga ggacagaggc aatgagttac gcaaggactg ctagccacca 

6721 ccagaagctg caagagaggc atggaatggg ttcttcacgg gctccagaaa ggaccaagcc 

6781 tgcttgattt cagacttctg gcctccagaa ctgtgaggaa ataaatttct gttgttttca 

6841 ttcacccagc ctgtggtaat ttgttagagt aaccctagga aactaagaca tctattgttt 

6901 ttcatctatc aactatattt ttattttata taagaaagtt cttaaaaacc aatttataac 

6961 ctcacaattt ctgctctaat tttatattct ttaatatcaa ggtgacttct aaaattttat 

7021 agaaataaat tgcttccctg taatcccagc actttgggag gccgaaacag gtggatcagt 

7081 tgaggtcagg agttcaagac cagcctagcc aacatggtaa accctgtctc tactaaaaat 

7141 acaaaaagtt agccagggat ggtggtgggt gcctgtaatc tcagctactt gggaggccga 

7201 ggcaggagaa tcgcttgaac ctgggaagca gaggttgcag tgagccgaga tctcaccatt 

7261 gcactccagc ctgggcaata atagtgaaac tccatctcaa aaaaaaaaaa aaaattgctt 

7321 caatgaaaaa tggacaatcg tagatccata attttggtgg tcactacctt ttttcattgt 

7381 cagagtgaaa taggagaaca ttttttagta aaatactgat gaagatgata atttctaatt 

7441 ggaatttgag gagacacttt ttcctttaac ctctctggct gctcctcagt ctcctttgtt 

7501 tcttctcgct gacctcttaa cattagagag ccactgggct tagtccttat acctattctg 

7561 tgcagtgtct ctactcactg gtgatctcac ccagtctcat gactaaatac cacctctatg 

7621 cttgatggct cctaaattca cagttccagc atagacctct caccccggat tcatgcagct 

7681 gtgactgtca aagagaaatt ttatgcttaa gatgtccaaa gccaaactcc aaacctccca 

7741 gaacgcctcc atactctttc ccatcctgct tgatgacaac tccatccttg cagttgctaa 

7801 ggccaaaaac cttggtggca actttgactc cactttctca tatcgtgcat cctattcatc 

7861 agcaaatcct gttgttggct cttccttcca aataaatcag aatcctacca tttctctcct 

7921 ccactgctac ccctggtcaa agctacatta tggtagtgtt aaagtataca aattggattc 

7981 aacctaaata tcctacagta ggggaatagt taaatgtcaa tattcactca atacagttga 
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8041 

8101 

8161 

8221 

8281 

8341 

8401 

8461 

8521 

8581 

8641 

8701 

8761 

8821 

8881 

8941 

9001 

9061 

9121 

9181 

9241 

9301 

9361 

9421 

9481 

9541 

9601 

9661 

9721 

9781 

9841 

9901 

9961 

10021 

10081 

10141 

10201 

10261 

10321 

10381 

10441 

10501 

10561 

10621 

10681 

10741 

10801 

10861 

10921 

10981 

11041 

11101 

11161 

11221 

11281 

11341 

11401 

11461 

11521 

11581 

11641 

11701 

11761 

11821 



ttacagtcta 
aaaaagctta 
ggaaatgtcc 
gccagtcagt 
aatttattca 
cagtaaacat 
agtttgttag 
atcttagcat 
tccagaggtc 
gaggcaagtg 
aaaaaacgca 
tcattattta 
tgattacatt 
aactgaaaaa 
gttgattttg 
agtgtgagat 
acaattggca 
ctgttaataa 
cctccaagtt 
ctagccctat 
tatctgtctc 
agtgctcaag 
ctaagtcaaa 
ctttgtcgcc 
cgggttcaag 
gcccggctcg 
gtcttgaact 
caggcgtgag 
ccagacctgc 
ttcccactaa 
aatggccaaa 
cataacagaa 
gaggctgaag 
cctcttactg 
tcctaatctc 
tctcaattta 
gtaccgggtg 
gcaaaccata 
agggcaaggc 
ctaacaggta 
atatggatat 
cttcagataa 
agaacgatcg 
tagaaaatac 
gtcaggcatc 
ttaacttaag 
tgaaggttta 
agaaaactca 
acctataaca 
tcctgttaac 
agtcttttaa 
aatgaagaac 
gtcttgttgc 
ttttagcatt 
cacaaatgat 
tttacagtct 
actttgtccc 
gttaattata 
ttcaaagcaa 
gcaaaagtta 
aactttaaaa 
ttccacaacc 
ctgaaagctg 
tcttgtggtg 



tggcttgaag 
acataaatta 
tgaagcaaac 
ttctgatgcc 
tcagttcccc 
ccatatacat 
gtcatgtttt 
actaaaaaat 
attataagtt 
tcttaaatca 
agccacatga 
ggggagagag 
cttgtgaggc 
agggagtaaa 
tctcatcctt 
ttaacaaaac 
tgtgttggtt 
tactcccctt 
tcagctggaa 
cccattatac 
acactccagt 
gcaatgcctg 
gcctcactat 
taggctggag 
tgattcttcc 
ttatttttat 
cctgacttca 
ccaccacgcc 
caacctctcc 
cttaagcccc 
ttgccaagag 
taccactgac 
tccatgatca 
ctcttcacac 
cccttcttac 
atttgaaacc 
ttagggcttc 
ggtagctgcc 
cctgttctca 
ttaagtacat 
tggaaatgtt 
ttaacttgga 
tagcaatact 
ctggttttac 
ccagttttgg 
aatttcctca 
atattctagc 
actggcctgt 
gattaatcag 
tactactttt 
cacattgccc 
tatattgttt 
tgtggtacaa 
tttttcaaaa 
gaaaaatttt 
tcactggcag 
tacagtaaat 
gacagtaagg 
aaattgtttt 
aatatttttc 
agcagcatca 
ctaataccac 
gggggaaagc 
aaacaagggt 



atgctagcaa 
cccatataat 
agcatcagta 
tgtaaatcaa 
caactgatgg 
aactttattt 
gttatttttt 
ttacaattca 
tcttgtgaat 
gtagaggttt 
gcatctgtga 
tgaggaggga 
tctgattaac 
gactcagtta 
attctgtacc 
ttggttttaa 
ttacagctct 
cctacacatg 
tgtcacttcc 
attttcattg 
cccagacagc 
gccaggaact 
catatatata 
tgcaatagtg 
tcctcagcct 
tttttagtag 
agtgatctgc 
actcctgtat 
agtgtaatga 
tctttgtgtc 
gcatttcaat 
tgaatggctt 
aggtgcagca 
ggtctttccc 
aaagacactg 
tctttaaagg 
aacatataaa 
ctcccaacaa 
ggaaattaga 
cacctaggag 
aaatattatg 
aaaacatgct 
tatattagaa 
atctgtatca 
tctgagcact 
ggtattaaag 
aatttctttg 
ggtacaatat 
aactttctga 
ctttagtctg 
cagatttcat 
gcccatatta 
atataatctg 
tagggattta 
aaacaattag 
tgcattcaga 
aattttttaa 
attacagaat 
atattttcta 
cagttacctg 
tttaaatagt 
tcaggcctgc 
ccctgagtag 
gtcaaaactg 



caagaagaaa 
tacaaatatg 
ttcttccagc 
gcatttttgt 
atagtttgca 
attgtaattt 
aaatgaaaaa 
gagtaaacct 
cagaaaacct 
attgggccaa 
cctgagcttt 
aaagggagag 
gctcagtaaa 
ttcatctgtt 
tgggaagata 
gagttaaact 
ctacaatgag 
acatacttgg 
tcaaggactc 
catattacac 
aaagaccatg 
caacaaatat 
taatttttat 
cgatctcagt 
cccaagtagc 
agacggggtt 
ccaccttggc 
aacaattttt 
ttacatggta 
aatttcataa 
caacagtcag 
aaacaacaga 
gggatggttt 
ctgtgagcac 
gtcagactgg 
ccccgtctcc 
tttgtttggg 
gggctgggat 
accaagaact 
tgaacttcgg 
taccaatcat 
ttcactgtgt 
acatcctatc 
tgatgagtgt 
gttagttgat 
aggtcagtga 
ttggaatcag 
acctagtatg 
aaacatttta 
acacacctga 
ttcctgttat 
aataactttt 
aaaaatttta 
ttataaaaac 
tatcctccct 
tRgtatagaa 
aacttttcat 
aagcagtaaa 
tgttgtttta 
aaacttcaaa 
ggttaactca 
agagtcacca 
taagtatgca 
aggggactct 



aaagtaccta 
taaaaatcag 
taattgtact 
agctaccatt 
aagtttaata 
ctagcgtgca 
ataaatgtac 
cttcctacaa 
gttaaccaaa 
agtttaaaga 
ctaaagaggg 
gggttaggca 
tctatgtttc 
tcagggtagg 
agcttgttat 
taggttgcag 
aaggggaggc 
ttttggctac 
cttccttggc 
atcttgtgta 
cctgtgtttt 
ttaatgagtg 
ttttttgaga 
tcactgcagc 
tgacaggtat 
tcaccatgtt 
ctctcaaagt 
taaaaaatct 
gtccctgggt 
agagtggtcc 
ggtaatgtgt 
aataatttct 
cctctgcttg 
atgcacctgg 
attaggaccc 
aaataaggtc 
gggcacagtt 
gcaatgaggg 
ctgggtttga 
accaagtttc 
ttgaaataaa 
aaaattaata 
aattcgttag 
aaagcacatt 
aactctctga 
acatgacctc 
ttttctattt 
gtttggcaac 
aagaacagtt 
agtgctaata 
tatctttgtg 
atagctatga 
tgaaaatatt 
tatacaaact 
ccaaaatagt 
aataacaaac 
caagtttcag 
Yggtcacctg 
tcctgcccca 
aatctattgt 
taaacgtgta 
gctcacaata 
acccagggtg 
ccaaggtcct 



aatatgcatt 
atgcatacgg 
gtttgcatta 
atggagttgt 
caaaataatg 
aattcccaga 
actgttaaaa 
cctctgtctt 
aaaagtgact 
catgccagag 
ttttggaaac 
gtgaggcaaa 
atataagata 
cagaagagtg 
cagcattgtc 
acccaaggtt 
cttgaggctt 
tcctactcaa 
atcccacttt 
actattttag 
ccatcttcct 
aatgtgagat 
cagtgtcttg 
ctccgccccc 
gtgccatcat 
ggccaggctg 
gctgggatta 
taagcttgat 
aagtccagag 
ttgcaaatac 
attcttttga 
cacagttctg 
cagatgctcc 
tgtttctgtg 
accctaatag 
acattctgag 
tagtctatca 
gctgtgccct 
atgaatgtta 
tattgtggag 
tggtccatta 
attttcaaaa 
cagaagaaaa 
tggataggga 
cttcatatca 
tgaagtcctt 
cacctgctac 
ctggtagcta 
caacagagtt 
acaccaaatt 
attttttaaa 
aactcgtcat 
taatgaggct 
ttttatcact 
gtgttctgtc 
attccatata 
tgttaMtgaa 
cactgtattt 
aacacttaaa 
actttgaaaa 
tgcaacaaca 
attcagtgct 
tcgggtgctt 
aatgaaaagc 
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11881 
11941 
12001 
12061 
12121 
12181 
12241 
12301 
12361 
12421 
12481 
12541 
12601 
12661 
12721 
12781 
12841 
12901 
12961 
13021 
13081 
13141 
13201 
13261 
13321 
13381 
13441 
13501 
13561 
13621 
13681 
13741 
13801 
13861 
13921 
13981 
14041 
14101 
14161 
14221 
14281 
14341 
14401 
14461 
14521 
14581 
14641 
14701 
14761 
14821 
14881 
14941 
15001 
15061 
15121 
15181 
15241 
15301 
15361 
15421 
15481 
15541 
15601 
15661 



agctcctcta 
tttctgtaca 
agctaattga 
caatacagac 
agtggcgtga 
tcagcctccc 
tttttagtag 
tgttggccag 
aaatgctggg 
ggttatttcc 
acctgttgac 
aaatcaccct 
agctatttca 
gtacacttaa 
tttaaaaaat 
gggcatgcaa 
atttctcgag 
ctgtagcctc 
gagtaggcat 
tgtcatgttg 
tcccagtatt 
ctaaaattca 
actcaataat 
gggttataaa 
atctatttaa 
tttggaaata 
acaatcatcc 
atgcaaagaa 
ccactccagg 
aagcaaaggc 
taatcggcca 
tgtcactcgg 
tcgtcctctc 
tgccaggata 
aaaatctaac 
actttctggt 
gtagggaagc 
tccagacagg 
aagtacccac 
tagttacact 
ccattccact 
agattcaacc 
gtacttaaaa 
ctcttatcca 
acaacaatca 
ctccacatgt 
gtcatttttt 
gggaaactat 
aaggagggaa 
ccatccccac 
tgaccaggta 
tatcttcatc 
ggttctcaca 
ttagtttcca 
tggtttttgg 
tggaaatcct 
gaccagtaac 
atgttcagct 
ttataactct 
aggaccagac 
ttcaacatcc 
tcttccatat 
agccagttga 
ctcccactgt 



aacacctgaa 
aaggctttgt 
gcagaaagaa 
tttttttttt 
tcttggctca 
gagtagctgg 
agacagggtg 
gctggtctcg 
attacaggtt 
taaaaacaaa 
actatttgaa 
tataaaattc 
attaccattt 
acatgaataa 
actgaaaaaa 
ctaagtaatc 
acagggtctc 
aatcttcagg 
gtgccaccat 
cccatgtcgg 
gggattacag 
aaggcctaag 
aaactccctc 
aagaaacacc 
tggaataagt 
agRaaaagcc 
taatccccta 
cagatatata 
catgaactcc 
gttctctctc 
tagagcctct 
cgctctctct 
tctcttcgaa 
atgaagtcta 
ataataatac 
gagttagtcc 
cacagtctta 
ttgtaaaaaa 
agttgtttgt 
tcacatatgt 
ctacatgggg 
aatcgtactc 
cttgcaagag 
aagggaaagc 
caattgtgct 
caacatttct 
tcttcaaaaa 
atgcttacta 
acaagggaat 
ccataagtct 
aacctaaata 
ttaaaatttt 
gcattgtatg 
agtgctctgg 
atttaagagt 
tgttgagcgc 
tgtcaatggc 
gtggaagagt 
ccactggctt 
ccactcagag 
atatgagttc 
gaaggaactt 
aactaatgac 
taaaagcata 



tcctttttga 
acagaaatta 
ctatgaagta 
tttgagacgg 
ctgcaacctc 
gattacaggc 
tagtagtata 
aactcctgac 
tgagccaccg 
agtaatccat 
aaggccttga 
tattatctat 
taagcaataa 
gaaggtaaat 
aaacaaaggg 
agactgcttt 
tgtcacccaa 
gctccaggga 
gcctggctaa 
tctcaaactc 
gtgtgagcca 
tatcaaatcc 
cgaaagttta 
agtgctctgc 
tgatcataga 
tccctaccac 
gtacaccctt 
tgcctgatct 
aaagctgagg 
cagctggctg 
ctctcaatgg 
cttgcttcca 
tctgtaacgt 
aaaaacagta 
aacccttcat 
aaacacatat 
tttttgaaac 
atcagttgtt 
gtgttttttc 
acatcatgga 
ccctactgca 
agcttctagt 
ttttaaggtt 
tctggatagt 
tatgcaatat 
gaagtcaaca 
atttatagaa 
attaaaaact 
aacttaacat 
caaacatctt 
atcttggaaa 
aaatgcttta 
ctgtgttccc 
ttagcatagg 
cttaccatta 
tctcttgagt 
taaaaccacc 
ggaagagttt 
caatcacagg 
ctgaacagaa 
ttaaaacaca 
caacaataaa 
ccatcaggtt 
ttgactgtga 



ctccattctc 
ttaaaaacta 
caaatttttc 
agtctcgctc 
tgcctcctgg 
acctgccaca 
tattctacta 
cccaggtgat 
cacctggtcg 
atgtcatcta 
aaaaaaaaag 
ataaattggc 
aaggcaaaaa 
ttcatgtgta 
taacttttgg 
tagtaagaaa 
gctggagtgc 
tcctcccatc 
ttttttaatt 
ctgggctcaa 
ccatgtctgg 
ctaaatctcc 
gacaggctca 
agaagaatca 
tttgtaaacc 
caaccttttg 
accatatatc 
cttattagac 
cacactgacc 
ctccttctgg 
aggaatgtgg 
taacttgtat 
aattccactt 
gaaaaagtac 
ggactaaatc 
gtagttcctc 
taggaatttt 
tggatccttg 
agagggggtt 
acatcagatt 
ctcacagtgt 
aaagcttcta 
ttcaaggatt 
aaagggaaag 
atcttcaatt 
tatcttacaa 
aatctgtaat 
ctttcaacta 
taaatcaggt 
tactgtcagt 
tttcttaatg 
ctcccatgat 
tccttttgtc 
tgggtcaaga 
gtttgaaaag 
acgttcaaag 
gaataggagc 
aggcatcatc 
caatcacatc 
gttccctgga 
gttcagctta 
atctaattag 
ggtgtactag 
tgctccattt 



atcactttaa 
tattttacac 
taaggttttc 
tgttgcccag 
gttcaagcga 
atacccagct 
tactatagta 
ccacccgcct 
gcaatataga 
atgcagttaa 
ggggccatta 
ctattttcaa 
caaagggaac 
ttttatcaca 
ataatgtatg 
ataagttttt 
agtgacacaa 
tcagtctcct 
tttttgtaga 
gcgatactcc 
cttgcttctc 
aaatactgtc 
ggtgagagac 
gtttttaatt 
aaaaggtaat 
gtcatctttc 
aataagggca 
ttgcaccaga 
aagcccctgg 
aagagccctt 
tagctagggt 
gggtggggag 
acagccagca 
agtttgaact 
tgtatattcc 
agttacaaac 
ctattacagg 
gccttccaac 
cggggaggaa 
ccaccacgct 
aggtatagga 
gcttggcaaa 
tcaaggtctc 
atttgcccta 
ctaggacatg 
tctgaacaat 
ggatagaatc 
gcaaactgca 
tgtcattctc 
atttttctcc 
aaattcattt 
cctgagttat 
ccagttgact 
ctcctactgt 
gcacatttgt 
tgatgtagga 
caatgactcc 
catgttagaa 
tgctccctca 
aagataaact 
tccaaatact 
aggaagaatc 
agtaagcact 
ggtggaaaaa 



ggatgaagcc 
taatgctaac 
ttattaatgg 
ggtggagtgc 
ttctcctgcc 
aatttttgta 
gagaccacta 
cgacctccca 
ttaaggtggg 
attaaagatc 
tttggaatat 
ataataattt 
ctactgaatt 
acttaaaaat 
atactaagaa 
ttttttttta 
tcaaggctca 
tgggagctgg 
gatggggtct 
caccttggct 
tttttgtatt 
acagataaag 
ttgtttcaag 
tttttaatgt 
ttctcaagta 
tcattctctt 
ccataatatt 
gactgttgaa 
gcatctacag 
taatctgggt 
ccagaatgtt 
ggaagaacgg 
tgccccatac 
ctagtatttt 
tttctaatag 
caaggaatca 
catttgaaag 
attatgtaac 
ggcagtgatc 
catttctttc 
atgcatgaaa 
acactgtctg 
taccatcaac 
aaataaaatc 
ctttttcact 
tcatgtggta 
ttacaagtga 
tgcttcatta 
atcccttatt 
cctctgagaa 
agtagatttt 
cataacagag 
ttctttaaat 
gctgtcacac 
gaaagctctg 
aaggcttgtg 
acaagtcagt 
caaaaatagt 
ggcatgaaaa 
gaaaacaacc 
atggaaaata 
actcttctga 
accttctctg 
aaagaaaaaa 
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15721 
15781 
15841 
15901 
15961 
16021 
16081 
16141 
16201 
16261 
16321 
16381 
16441 
16501 
16561 
16621 
16681 
16741 
16801 
16861 
16921 
16981 
17041 
17101 
17161 
17221 
17281 
17341 
17401 
17461 
17521 
17581 
17641 
17701 
17761 
17821 
17881 
17941 
18001 
18061 
18121 
18181 
18241 
18301 
18361 
18421 
18481 
18541 
18601 
18661 
18721 
18781 
18841 
18901 
18961 
19021 
19081 
19141 
19201 
19261 
19321 
19381 
19441 
19501 



acaaacacca 
aatcagtcat 
gaaataagtg 
aacacttact 
gaaagtgacc 
aagaaactaa 
acacagatat 
gaattctaat 
aggtgggtgg 
agacactgga 
cctaaccctc 
cattgctgac 
gttaagccat 
atgtcctgtc 
aaccaaccaa 
cacaacttag 
aaagcaagaa 
tgtaaatggc 
cctaggctac 
attgaggtga 
atatgtggct 
catgcatgtg 
aaactagcaa 
aaaaatatgg 
tgggaggccg 
tggagaaacc 
ataatcccag 
ttgcagtgaa 
ctcaggaaaa 
tatataggta 
ggtgggagga 
ctactaaaaa 
gaggctgatg 
gtgtcactgc 
tgggtttgca 
tagttcataa 
agttatgtca 
gagacctctt 
atgatggaac 
cagagccaga 
catgaaaagt 
ttaacaaaag 
ttttgagaca 
actgcagcct 
gagaccacag 
atggggaatc 
ttggtctccc 
attttattta 
gaatctgtca 
tagaaaaaaa 
cattctctat 
aactttattc 
tttgagtaca 
aacaatggca 
aattactttt 
tgattaagta 
acaacagaag 
acctccttat 
atagatacaa 
gagggacttt 
agaactccac 
cagtattcaa 
agtaaagaaa 
gatcgagagg 



tgtggtaatc 
ctgatacagg 
atcagctgaa 
attcttaggc 
cagaatacac 
gagtaaacaa 
acatcaaagc 
tttcttcatt 
gaggaaccaa 
atagtccaag 
aaaatgacat 
ttcatgattt 
gaacccaatg 
atgccactgc 
ctatctgctc 
gccgacccag 
taggctgact 
cccatacctc 
aggactggac 
acttatagat 
ttcttgtaaa 
aagaacggac 
tctaatcaga 
gtaggcctct 
aggtgaacag 
ccatctctac 
ctactcagga 
ctgagactgc 
aaataatgtg 
ggccaggtgc 
ctgcctgagc 
taaaaRctta 
taagaggatg 
agtccagcat 
cccactagca 
acaaacatat 
aaaagtttag 
tatttctgtt 
aaagtttcaa 
atgcatatag 
acattaaatt 
aggggaagcc 
gggtcttgtt 
caacctcccg 
gtgtgcacta 
actatgttgc 
aaagtgctgg 
gcacatgtag 
tttcgccaag 
aagtcttcag 
agagttgaca 
tagtatttta 
gatgctgttt 
aagaaagtta 
taaaaacacc 
cctgagggac 
accagaaaca 
ttcctttacc 
agctacctct 
tactagtcct 
aggagaagca 
acagtcactc 
gtcactgacg 
attccaaatc 



ataaatagaa 
taacacagcc 
cacacaggat 
agcccatgaa 
catcatcatg 
cagctttacc 
cttaaagtaa 
taactgtcaa 
gcaggtcaac 
ctcttccatt 
tgaattatat 
catactgcaa 
taaacagtga 
acactgtgac 
ttcccaaacc 
aacaaatata 
gctgtttaat 
aagacctaca 
cttgagactc 
tctattctct 
aggctttcaa 
cctttcagag 
ttgtagtggt 
tggccgggca 
atcacttgag 
taaaaatata 
ggctgaggca 
gcgactgcac 
tgtgtgtgtg 
agtggctcaa 
ccagtttaag 
ccaggcatgg 
gcttgagcct 
gggtgacaga 
ttgaggagat 
ataaagggta 
agacaactgc 
ttctcattcc 
tacatcgtcc 
aaggtttttt 
actagaaatt 
cattaagtag 
cagtcgccca 
ggttcaagca 
caacacctgg 
ccaggctggt 
gattataggc 
ggtagagcct 
cctacacaac 
agagacaagg 
gcctcatgaa 
agaacattgt 
ttgtttgttt 
acttataagt 
aaaaaaaccc 
acctggggtt 
aagtaccagg 
tccattattt 
aaagacattc 
ggtgctgtct 
ctgacatgca 
gaacacacaa 
agatgaggaa 
gccaccacac 



cttcgttttc 
ctccccagtg 
aacataatca 
aactactgga 
aaaaatcttt 
aagttagagt 
caaccaggga 
tacaatcttc 
gctctgcact 
cactagctct 
ctccaagctc 
aataaaaaat 
gtccacttaa 
atgtgatttc 
cttctcctgc 
tgagaactcc 
actgacactg 
tatccaggtc 
catattctct 
agtttaacta 
gaaatctaag 
aataagctca 
cagccaaata 
tggtggctca 
gtcaggagtt 
aaaatgagct 
ggaaaatcac 
tctagcctgg 
tgtgtgtgtg 
gcctgtaatc 
accaccctag 
cagcacacac 
tagagtttga 
aagagatcct 
gatcactcta 
tatgctaaaa 
tctagactac 
tcacatctat 
aaggttataa 
cctgacttta 
ataacctatc 
tagttggttt 
ggctggagtg 
atcctctagc 
ctaatttttt 
ctcaaacttg 
atgagccacc 
tgaatctgta 
actgcttttc 
gacacatgac 
atggctattt 
cacagaaagg 
ttgacaataa 
taaccagatt 
cagaaccagc 
tggcagcgga 
ttccccgact 
ctaataaaca 
tatatttcat 
gaaagtgtca 
acaccactcc 
aacatctgtt 
ccccagcact 
agctaccaag 



acagtaacaa 
aataaaacat 
ataaatctaa 
atgaggatag 
aggtttcctg 
tggtttcttg 
atatgttggc 
tttgaaggat 
tattaacaaa 
gtgaacttga 
taaatcacct 
acaagtaaaa 
cagattttta 
ctcatgtatc 
tgctggactc 
tctttRtaaa 
gcaccaccat 
agagtggaag 
gaagccaaca 
tattaacctt 
tattgaaaat 
ttgcttccca 
ttttttaaat 
cgcctgtaat 
tgagaccagc 
gagcatggta 
ttgaactcgg 
gagacagagt 
tatgtgtgtg 
ccaacacttt 
gcaacagtga 
ctgtagtccc 
ggttgcagtg 
gtctctaaaa 
acggtccttg 
ttatttttac 
tttataaagt 
cctatgagat 
gctagaaagg 
taatttatgc 
atccctccat 
ttgtggtctt 
caatggcacc 
ctcagcctcc 
tttttttttt 
agctcaagca 
atgctcagcc 
tttcaaaaac 
atcagactat 
ttgtactcta 
ttatgtctaa 
aaacaaccaa 
tcaggaaatg 
ttaaaaactg 
cctgtgctac 
tgggagtgga 
cagctgccat 
cataccactt 
ctactgtgcc 
caccatcaat 
aggatgcaaa 
acatatcgcc 
agtccaacac 
aacattccac 



ggctatttag 
gcaaaatgat 
cttttgatga 
ctatgcaaga 
aaaaagcaaa 
gaacatctga 
aactagaaat 
gtgttctaga 
atcatttaga 
gcctcagttt 
gttagctcta 
ggcattcaaa 
ccgtcaaagc 
aggaaagagg 
taaaatgctg 
ttaagtttaa 
tcagtccatc 
actgggagtc 
gcaatatgcc 
cacaaaatac 
agttacgatg 
cataaggtga 
gctaatgctt 
cccaacactt 
ctggccaata 
gtgtgtgcct 
gaggtggagg 
gagactccgt 
tgtgtataca 
gggaggccga 
gactctgtct 
agctacttgg 
agctatgata 
aataaaaata 
agctatgaga 
caataggaat 
tgaaaattaa 
agtttcacag 
gaactagtga 
tcattctcct 
cctctaagca 
tttgttttgt 
atcatagctc 
cagagtagct 
ttcagtagag 
atcctcccac 
attaggaagt 
atccccatgt 
ccagtagtct 
ttatatcctg 
cagtcccatc 
atagaatact 
taagttatta 
atagcaattg 
tgccccaggg 
ggttgagggt 
cagagccttt 
atcacaacaa 
tttgaggaag 
gcagatttcc 
tccaattagg 
tacccagtgg 
agagcatgca 
cgacgcttcc 
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19561 
19621 
19681 
19741 
19801 
19861 
19921 
19981 
20041 
20101 
20161 
20221 
20281 
20341 
20401 
20461 
20521 
20581 
20641 
20701 
20761 
20821 
20881 
20941 
21001 
21061 
21121 
21181 
21241 
21301 
21361 
21421 
21481 
21541 
21601 
21661 
21721 
21781 
21841 
21901 
21961 
22021 
22081 
22141 
22201 
22261 
22321 
22381 
22441 
22501 
22561 
22621 
22681 
22741 
22801 
22861 
22921 
22981 
23041 
23101 
23161 
23221 
23281 
23341 



agtgacagct 
cggcagtact 
ctttctgtag 
ccctctctag 
agggccagtg 
cattctgtta 
attttgcttg 
aatcaatatg 
cagaaggtct 
acataagtaa 
aaacacagca 
tacttgatag 
cctatgaaga 
agctatggtt 
ttaagctcct 
aagaaattca 
ctacaggata 
tttcatagct 
acgtgtatgt 
tgtaaaaggt 
ttaattcatc 
gtaatgcaag 
ccatagctct 
atctcaattt 
ggcatggtgg 
tgaggctagg 
tgcactccag 
tagtgataac 
atgttgagat 
taatagcagg 
agatcaggat 
ggatttttta 
attcagtgag 
cattcccttt 
tgcagcccca 
atatttttac 
tcatccaggc 
tcaagcaatc 
agcctggcta 
gtctcaaact 
caggtgtgag 
ctcactatgt 
cttcccaaag 
ctaatctcgg 
attgtattcc 
taaaaatgca 
agatagtaga 
aagactgcaa 
gtccaaagaa 
acactcttcc 
attaatattt 
catacatgta 
tatgtagaca 
tgcttgctga 
aacagagaga 
gacaaaaggt 
ttaaaagaga 
gattcaaaca 
tttgatgcta 
gtggttctat 
catgatgacc 
gaggcagaga 
taaggctgga 
ccgaaactca 



gtcagaatca 
tccaaccgcg 
gtcaggcact 
ggaggggtct 
acaggcttgg 
aacataatgg 
ttgtacatct 
gcccctcctt 
gaatgagtag 
taggctgtca 
cctgccttct 
gtgtcagtct 
ataattYtaa 
aagcaccctt 
ggaatttaca 
gcagccatgg 
cactttcttg 
cattttaata 
gatacgaaat 
aaaagaaacc 
ctatcctgtt 
tatacagata 
acctattttc 
caataaatat 
cttgtgcctg 
aatgtgaggc 
cctaggagac 
tttgagacca 
attatatatt 
aaaggagttg 
tacatttgag 
ataatcgaaa 
catgatttac 
ggcatctgaa 
tagcctgatt 
aaagacaacc 
tggagtgcag 
ctcccacctc 
attttttgta 
cctgagctca 
ccaccacaca 
tgcctagact 
tgctgggatt 
ttatctttag 
cctgttaata 
ttgcaactca 
gataagtaag 
aatggaaata 
acaaatgaag 
taaaaaataa 
atagaacgaa 
tattacaaat 
atcaatatca 
tagcggtatg 
gagtgttcct 
ggcagggagt 
tttaaaagac 
aactgtggaa 
ttaaagaaat 
tttttaagtg 
agggatttgc 
agcaggtaca 
taattggtgt 
caatttaaaa 



gattcactgt 
tgccttagac 
gagggctctY 
gggctgagac 
cttgtcaata 
ggtagggagg 
catatcttca 
tccctaagta 
ttcagaaaag 
cacacaatat 
tgaataccca 
tgtaatcagt 
gtggcaaacc 
tttacagtgc 
tgttacagca 
gagtaattat 
tatataggtc 
tatttatgtt 
tacacattaa 
caaactgtcc 
tagaatgatc 
actttgttcc 
ttaaattctt 
aaaatgMtac 
tagtctcagc 
tgtagtgcac 
acagtgagac 
cagcttagca 
ctttacaata 
ctatttcaaa 
ctaccttgat 
acatcactta 
atcccagttc 
tctctcagga 
catgtgaaac 
aacttctttt 
tggcacaatc 
agcctcccca 
ttttttggag 
agcagtctgc 
cagccccaat 
tgtttcaaac 
acaggcatga 
tcaactaaca 
agagaaggtt 
gtagactcac 
tgcttactct 
gcaactacat 
aaaccaagca 
tgaaaattta 
ccttcatcaa 
ttatctttaa 
acaactgctg 
gaacacttcc 
aaaacaaata 

gggggggagg 
acatcaacta 
aacaattgag 
attttcaata 
cttatcttta 
tttaaaataa 
gcacagctga 
atggaggctc 
acatgtatta 



acaaagagag 
atacaatacc 
cagtggtgcc 
aaggaaatta 
aattgggatt 
tatctgtttt 
acatatcaca 
ccaaggaatt 
gtgttcagag 
aggccaaact 
gttaaactat 
atataaaaga 
accaggaaag 
aatttaaata 
ttcagaagaa 
tattaaacag 
agtttccatt 
ttctgcttaa 
attttaatag 
caaagcaaaa 
aatcattacc 
cacaaataag 
cacagctgtt 
atatcaacac 
tacttgggag 
tataattgta 
tccatctctt 
actaagttct 
ctctcttccc 
caatttaaca 
gctcagttat 
tataaaaagc 
ttcagtgtgt 
ctgagcagac 
caagtcaatc 
ttttttttct 
ttggctcact 
tgtagctggg 
aaacagggtc 
ctgcctcagc 
ttgtctttaa 
tcctggcctc 
gccactgagc 
gatctagcag 
atactccgct 
gactacaaga 
cctcacaggt 
tggacataat 
gggcaaaaag 
gtggataata 
aaaagctaag 
aatactcttg 
ctataacaaa 
tatgacatat 
atcctgtttc 
tggtgggggt 
aatactgtgt 
atgactccag 
ttttaaaatg 
aggtatacat 
tccagtgata 
aacaagataa 
gctgaagttt 
cacttataag 



aggatgaact 
gttaccagtg 
atgaagccag 
gctagggagc 
gagaaggaaa 
caggtaaaga 
tagccagaac 
caccatacag 
actagatgga 
gtagaaggaa 
ataattttca 
agaatcccat 
gaaaaaagca 
tttaatctac 
aaaaaaaatc 
atacaatatt 
gttaaattgt 
aataagaata 
ttttgtaagc 
tataactttt 
ttcatccatt 
cctaatcctg 
cttggtaata 
accaaaaaaa 
gctgaggtgg 
cctgtgaaca 
taaaaagaaa 
ttgttacaga 
aattaataaa 
taagtttaaa 
acaacttgtt 
agcccaaata 
ttcactgatg 
atttcaaact 
gccatcacca 
tgagacagga 
gcaacctctg 
actacaggca 
tcaccgagtt 
ctcccaaagt 
ttttttggta 
acacagtcct 
ccggccctga 
ctgataaaat 
gtagggagaa 
ccccagctac 
cattccccag 
acctgttttc 
agtgaagaac 
catccaattt 
tgaactgaca 
ccacccctaa 
aagagaatcc 
tcctgggtaa 
ttcaacaaat 
gggtggaggg 
atagatctca 
aagcaaactc 
aaaattatgt 
actgaaatat 
aaggagggtg 
tggccataaa 
ttcacaataa 
ttatttatta 



gctggataac 
ccagcttcag 
catagagcaa 
taaacctcaY 
agttgacaag 
aaacctcaac 
acaggtctat 
attaaaaacc 
ccattatttt 
aggtgaagta 
cttaccatca 
gatgataaag 
aaacaacaaa 
aaatacatgc 
taaaaggcag 
aaagtaccaa 
tatttgttca 
gcactttgtc 
agaaataaaa 
atatcttctt 
aaaacacctg 
atcatggagg 
cttgtactgt 
cccagagtga 
gaggatcact 
gccactacac 
aaaaaaaagc 
atcattctac 
aaaagtgtca 
gagctttatt 
gattaaataa 
attcaaacca 
ttgatgctgg 
cactgtacac 
atatccacac 
tcttactctg 
cctcccgggc 
cacaccatca 
acccaggttg 
gctgggatta 
gagaagggat 
ctcacctcag 
caaccaatgt 
tcactgcata 
tagaaaaaaa 
aaatctatga 
caacactgaa 
cagaatctgt 
actttggaag 
gatactttgt 
taatttatcc 
tgaatgaacc 
agacaatata 
agtggagaga 
aaactgcatg 
agaatacagg 
tctgcaacct 
tggctgttaa 
taatgacacg 
ttgtgagtaa 
atgaggtgga 
tcaaaattgt 
aaaatttaaa 
taatatttat 
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23401 
23461 
23521 
23581 
23641 
23701 
23761 
23821 
23881 
23941 
24001 
24061 
24121 
24181 
24241 
24301 
24361 
24421 
24481 
24541 
24601 
24661 
24721 
24781 
24841 
24901 
24961 
25021 
25081 
25141 
25201 
25261 
25321 
25381 
25441 
25501 
25561 
25621 
25681 
25741 
25801 
25861 
25921 
25981 
26041 
26101 
26161 
26221 
26281 
26341 
26401 
26461 
26521 
26581 
26641 
26701 
26761 
26821 
26881 
26941 
27001 
27061 
27121 
27181 



tacacataca 
ttgcattaca 
tcactaaatt 
acccaaaaca 
aaatacataa 
taagaaacac 
tctgagaata 
tataatccca 
atgagcctgg 
tggtggcaca 
cccaggaagc 
cgcagccaga 
attaaacgaa 
tagaatttaa 
atcccgtgaa 
aaactaagac 
caataccagg 
tccatcaaaa 
atgaaaggag 
gacagtattg 
tatagtctcc 
agagtaacaa 
gatcagagct 
ttagaagggg 
tcagtccatc 
ctttgggagg 
atatggtgaa 
cctgtaggac 
acctggccaa 
ggtacacgcc 
agaggcagtg 
gtgagactgt 
atatatatac 
tcttgagtgg 
ccaaatttct 
ccgcccaggc 
tcacaccatt 
caccatgccc 
tgttagccag 
gtgctgggat 
tgaaccccct 
ctgataagtc 
atcgagttct 
cttcagtaag 
ctcttagaaa 
gcccatgttg 
agtggtccct 
gctttaacta 
gtagagaacc 
ctccttgaaa 
ttgaatatgc 
ccaatggcct 
atcagcagca 
cttcaacata 
gatcttcaat 
cccttttctc 
ctccccttta 
agatttccaa 
tcaaccatca 
ccaaaactga 
agcgaatggt 
ctctttctct 
cactcatgag 
atccagagaa 



caaattacta 
accaatgatg 
catcacccac 
agacatctat 
aatgcctaag 
agacagaaga 
aggagcccat 
gcactttggg 
ccaacatggt 
tgcctgtagt 
agaggttgca 
ctccatctca 
agataaaaac 
tttttacacc 
cttttcagca 
acagttagat 
aaggagggag 
gcagaatcta 
tgaatagact 
ctaagggtta 
ctactccaag 
tagaactaac 
ctctgtctcc 
cttttgcatg 
ggttaaaaat 
ctgaggcggg 
accccatctc 
tttgggaggc 
catggttggt 
tataatccca 
gttgcagtga 
caaaaaaaaa 
acacacacaa 
aagattgtct 
ttttcgtttt 
tagagtgcag 
ctcctgcctc 
ggctaatgtt 
gatggtctcg 
tacaggcgtg 
cctaggccca 
agcttagcaa 
tcatcctcca 
aatcccctta 
ttttccatcc 
tactcaagag 
atgcctattg 
gtgtcactga 
ataatctctc 
gagctacttg 
tccagtcagg 
ccatgttgct 
tctgccaaag 
catcactttc 
ttcctaagct 
taagtcacac 
tagataacac 
gatgctctcc 
gttaaacatc 
actcttaatg 
cgctctctct 
caaacccctc 
ccaaatctgg 
taaaatagga 



ccagtcttat 
aagcacggta 
catcatgaga 
acaatctcat 
agtgctcaga 
gcagaattca 
aagaaagatt 
aggccgaggt 
gagaaccccg 
cccagctact 
gtgagccaag 
aaaacaaaaa 
accacattgt 
aaaaccatta 
agtataatca 
actcttccat 
agaaacaatt 
cgataaaaca 
aggacaggag 
tgaaccacag 
ctcaggaaaa 
atcactagga 
taagcctaac 
aagggttcaa 
atatggccag 
tggatcactt 
aactaaaaat 
cgaggcgggc 
caggtctcta 
gatactctgg 
gctgagatca 
aaaaaaaaaa 
acatatctgc 
tgagtggaag 
tttttttttt 
tggcgcgatc 
tgcctcagcc 
ttgtattgtt 
atctcctgac 
agccaccgca 
tctgcgcaca 
gaatccccta 
ccatccatcc 
ggtcagtgca 
actgactccc 
ttgggcccaa 
taatactacc 
tttttttttt 
gcttgaacac 
aactttctcc 
cgcctacccc 
aaatccaagg 
ttgattactc 
cttttaccaa 
ctaaacattg 
tgagtgtgcc 
catttagcat 
agctaagact 
tgcacataaa 
tatcctccca 
ctccagttgc 
atccaatcca 
aacaatttga 
atctacaggt 



caattataca 
tcatttagtt 
taaacccaag 
gctaaaataa 
atggagaaac 
caggggatat 
aaacaacagc 
gggcggatca 
tctactaaaa 
ccagaggctg 
gatcaggcca 
acgaaaaaca 
ttatttcaga 
aatttgctgg 
ccaagacagc 
aacattagag 
atacaagtaa 
agacataaaa 
caagttgaag 
ggcaaggttt 
gcactctaat 
aggttcaact 
agagagagac 
gggaaagaaa 
gcgcagtggc 
gaggccagga 
acaaaaatta 
atatcacctg 
ctaaaaatac 
aggctgaggt 
tgccactgca 
aaaaaaaaaa 
caaggtctct 
actgagtggt 
tttttttttt 
ttggctcact 
tccctagtag 
ttttttttta 
ctcgtgatcc 
cccggcctgc 
tctttgtaaa 
tgcttgatat 
cccaggtgat 
gacagaatgc 
accctactgc 
tctctcttcc 
ccccaccttg 
ctttaacagc 
aactaataga 
agtttctgtg 
acctctccac 
gtcattttca 
tctcttccat 
tctggctgca 
gagcacccag 
tgcctctccc 
gaggccttaa 
ttcctctgaa 
tgctcaatag 
accttctcct 
tcaggccaaa 
tcagtacact 
acatgaaaat 
ccaggtttaa 



tattgccttg 
tactgaagag 
aaaataatcc 
aacaaggctg 
catgaagaca 
aatacctaaa 
cgggtgcagt 
cttgaggtca 
atacaaaaat 
aggcaggaga 
ttgcactcca 
aaaaaaaaaa 
cctgtggttg 
gatgattttc 
aaggtcttcc 
gggaactaaa 
aatgagaggg 
caggcattca 
gttaaagtgt 
tagttaggag 
actcatcaat 
ttgggcggga 
aggcctgggg 
tggaggaacc 
tcacgcctgt 
gttcaagacc 
gtcaggcaag 
aggtcaggag 
aaaaattagc 
acgtgaatta 
ctccagcctg 
aaatatatat 
tgcattgtat 
gacaaagagc 
tgagacagag 
gccagctccg 
ctgggactac 
gtagagacgg 
gcccgccttg 
ttggccaaat 
atccagcttc 
caaatcaccc 
gtctgaccac 
ccccttgccc 
ttggctacaa 
ccactgcaaa 
aataaaatct 
gggatacggc 
ctttcttctt 
ctcccagctc 
tgcaacagct 
g tctctttct 
gcaatactgt 
gcttctcagc 
ggctcagtcc 
acctcccctt 
atgccatata 
ttccaggctt 
ggttctcaaa 
ccaacagtgc 
aactacaaag 
gtactccaaa 
aaatgatagt 
aataaataca 



gagaaaaatg 
aaaaatatgc 
ctgaactaaa 
ccacaaatca 
tacagtaaaa 
tatcagcttt 
ggctcacgcc 
ggagtttgag 
tagccaggcc 
attgcttgaa 
gcctggatgt 
acccagaaag 
actatcattc 
tcaaatgaaa 
aacccaataa 
tcccctatca 
tatggaaggt 
ctctggggga 
ctttaactat 
gcaatatgga 
cttgaaaatg 
agaacaactg 
gcctttctat 
cagacatact 
aatcccagca 
agcctggcca 
gtggcgcaca 
ttcaagacca 
agggtgtggt 
cttgaacctg 
ggtgacaaga 
atatatatat 
ccctcaagga 
ctttgcttgg 
tctcgctctg 
cctcccgggt 
aggcgcccac 
ggtttcactg 
gcctcccaaa 
ttcagtctcc 
agcaagaact 
tcgatatcta 
cctggtctgt 
ctcatgtttc 
attcccagta 
attccatcac 
tcctcactgt 
tctagcagag 
tagagcaaag 
ctatttgttc 
ctcactggta 
tagctgacct 
ctccccttgg 
ccattggtag 
tcagacccct 
ccacctctgc 
ttcagtctga 
atatatataa 
ctgaacttgt 
ttcccacctc 
tcctttacac 
catatccagc 
aaaggatttc 
tggataaata 
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27241 
27301 
27361 
27421 
27481 
27541 
27601 
27661 
27721 
27781 
27841 
27901 
27961 
28021 
28081 
28141 
28201 
28261 
28321 
28381 
28441 
28501 
28561 
28621 
28681 
28741 
28801 
28861 
28921 
28981 
29041 
29101 
29161 
29221 
29281 
29341 
29401 
29461 
29521 
29581 
29641 
29701 
29761 
29821 
29881 
29941 
30001 
30061 
30121 
30181 
30241 
30301 
30361 
30421 
30481 
30541 
30601 
30661 
30721 
30781 
30841 
30901 
30961 
31021 



aatggcaggg 
agtgatacat 
atcactggat 
aagtacctcc 
aagacaccaa 
gagacaagtg 
aatcaaaatt 
aataaatatt 
aaaaaaagac 
gggatcctaa 
ctcactctgt 
cttcctgggc 
gtgccaccgc 
ccaggctggt 
tgggattaca 
gacactattg 
tgttagcttc 
atttaatggc 
attgtgtgtg 
cggcccctga 
agtaaaagtc 
tgagacagca 
tgaagatgaa 
cttccttatg 
tagtctatga 
aggcttctct 
acgggtgggg 
ggatcacttg 
actaaaaata 
ggaggctgag 
catgccactg 
agagttatac 
tcaaaggtca 
cattaagctt 
tattttctga 
ctcacaacac 
tcagtggcct 
gccagtgatc 
cctttccctt 
caggctgcct 
ctgcaacccc 
cctcatcctg 
tcaaggactg 
ctccctttcc 
ttataaattt 
tttttgctgt 
ggggctcaaa 
caaaggagaa 
ctcctgcctc 
gaaacgggtc 
aaacacttgt 
ttagtgttta 
aatggtatat 
tcaaaaggcc 
cttccaaagg 
agttacgacc 
gtatttctct 
gtgtccaatg 
cacttatttt 
agtagtagtt 
gccggactgc 
cacgccattc 
cccggctaat 
ctcgatctcc 



gagggaaaac 
tagagaaaca 
gccaaaacta 
cctaaagaca 
taaccaagtc 
tctcctgaga 
taattatgag 
ctgtacttct 
ttgttccaga 
actgaaccct 
tgcccaggct 
tcaggtgatc 
acccagctaa 
cttgaactct 
ggcgtgagac 
gaacaactag 
gtaattttga 
aaaaaggtct 
tgtgtgtgtg 
acaacacggg 
acactgagtg 
acaacaaccc 
ggcctttatg 
atttccttaa 
taaatataac 
ggtcaacact 
tatggtggct 
aggccaggag 
caaaaattag 
gcaggagaat 
cactccagat 
acaggttttc 
attgcatata 
tgaggaatct 
gaaaccattt 
cttcactgcc 
cctaaccaat 
tcattaaaat 
tcattcagaa 
gcccctgcac 
tgtcatcact 
gctgctccct 
ccctggccat 
tactttattt 
atcttgtctg 
tttgtttact 
aggtatgtgt 
tttgtaaaag 
aaagctgcaa 
ccaaacaatg 
cttatcatta 
ggggattttt 
tcataaatta 
atcttacatg 
ttttttttaa 
atgaaaaaag 
tgtgcactgc 
ttagccacag 
tgtctaacag 
cttttttttt 
ggactgcagt 
tcctgcctca 
tttttgtatt 
tgacctcatg 



ccttcctcag 
ccatttggca 
gcaagtgaga 
cttaaaaaaa 
atcaaggatg 
aggacacatc 
gaaatatcaa 
caaaaaagtc 
ttatacaact 
ggacagggga 
ggagtgcaga 
ttcccacctt 
tttttgtatt 
ggagctcaag 
actgcaccca 
taaatgtgct 
taattttact 
ttatgtctgc 
tgtgtgtgtg 
tttgaactgc 
tgcctgcctc 
ctcctcttcc 
atgatccact 
tgacatttac 
atacaaaata 
aggctaatag 
tatgcctgta 
tttgagacca 
ccagggatgg 
cacttgaacc 
tgggcaacag 
aactgcacag 
gagagagaag 
ggatgaagat 
caagataaac 
cccattctac 
ctgcatttct 
atcttagatc 
tagaagcaaa 
ctctcacctc 
gcagttcttc 
ctgcctggta 
gctatctatc 
atcttcttag 
tctcatctcc 
gctatgtccc 
tgaatgaatg 
tttaaagcca 
gcgtatgtgt 
acattgtata 
gctgttaggg 
ttaagctatt 
taattcttgc 
gtttgtatgt 
tagtttaagg 
ttttttcacc 
tctttcctca 
actaacgcag 
tctctgaaac 
tttttttttt 
ggcgcaatct 
gcctccccag 
tttagtagag 
atccacccgc 



agcaggatgg 
accacctagt 
gtttgagaat 
ggaacagtta 
tcaccaccag 
acttcctttg 
agtaacccaa 
aaggtcttaa 
ataaagagac 
aaaaaaaatt 
agcacaatca 
agactaccaa 
ttttgtagag 
caatccgccc 
gccaaaaata 
aacgtgtgaa 
atggttatgt 
aacttactct 
tgtgtgtgta 
atgggtccac 
tcctacctcc 
tcctcctgtt 
ctcacttaat 
ttttctctag 
tgtgttaatc 
tagttaattt 
atcccagcac 
gcctggccaa 
tggtgtgtgc 
tgggaggcag 
gagtgaaact 
gagtcggctc 
aagagagaat 
tatacagaaa 
tgctaaatac 
ctaacctagt 
gctcttgcct 
atggcaccat 
atcccatact 
acctctactt 
acaccctcag 
catgccacac 
tcaaattttc 
cactcaccat 
ctcactataa 
agtgcctaga 
agaactagac 
catactcacc 
gagcaggaat 
tgacaccttg 
taaccacctg 
ctgtgataat 
ttaaaatata 
gctaatgtaa 
aaaatgcgat 
tctgctagac 
gtgtacagcc 
acagtctgtt 
aagaaaatat 
tttgagacgg 
cggctcactg 
tagctgggac 
acggggtttc 
ctcggcctcc 



caacaaataa 
aataaatgtt 
gggatattta 
cttcagtgga 
taaagagaca 
tattcctgcc 
attgaagaca 
aagattaaaa 
acatcggtta 
tgttgacggg 
cggctcactg 
gtagctggaa 
acggagtttc 
atcttgacct 
attttttttt 
ttaaataact 
aaggaaataa 
catatggttc 
cgtatctata 
ttacatacag 
ccttccacct 
cctattcaat 
gaatagtaaa 
ctcattttat 
aagtatttat 
tggggaagtc 
tttgggaggc 
catggtgaaa 
ttgttaatcc 
aggttgcagt 
atgtctcaga 
ccccacaacc 
gaaaagcaaa 
ttctttatac 
acacacacac 
tatctctccc 
ccgtcttttc 
ttgctcaaaa 
acagtctaca 
cctttgccct 
acaggctccc 
ccaggtacgt 
ctcttactgg 
tttctcacgt 
tataagctcc 
acagagagtg 
aaaagcatta 
tagggaagca 
gtaggcagca 
tcccgggagg 
aagccaaaaa 
cccacctacg 
caccaaccca 
acaagctctc 
ataaacaatg 
tataaacaag 
caatgcctaa 
aacatctcgc 
tacatgaggt 
agtctcgctc 
caagctccgc 
tacaggcgcc 
accttgttag 
caaagtgctg 



acatagaagg 
tcaggcaaag 
ctgacactcg 
gaaacttggc 
aataaataaa 
aaaaaagcat 
ttctaaaaaa 
ttaaaaaaaa 
aatgcaatgt 
gagacggggt 
cagcctcaac 
cgacaggcat 
gccatgttgc 
cccaatgtgc 
acttataaag 
actgtatcag 
atacagaagt 
agagaagaaa 
catacacagt 
actttttttc 
ctgccacccc 
gtgaagacga 
tatactttct 
tgtaagacta 
gtaatcaata 
aaagttatac 
caaggcgggt 
ccccgactct 
cagctactca 
gggccaagat 
aaaaaagaaa 
cctacactgt 
tagcctaata 
tatttttgca 
acacacaccc 
ctggattact 
tcaaaaagca 
ccttccaatg 
aggccacaca 
cctcgttctc 
aggtgcaggc 
gcgtggtgca 
catgtgttat 
attacatatt 
agaacaagta 
gaataacgta 
tctgcaaggc 
caactaccct 
gatgtattta 
gaggagaagg 
taacacatga 
tgtaatatta 
tttaatatgg 
acacaaaatg 
tataaatgct 
gatgggaact 
cacagaggat 
agacagtacc 
tttgttatgt 
tgtcgcccag 
ttcccgggtt 
cgccaccgcg 
ccaggatggt 
ggattacagg 
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31081 

31141 

31201 

31261 . 

31321 

31381 

31441 

31501 

31561 

31621 

31681 

31741 

31801 

31861 

31921 

31981 

32041 

32101 

32161 

32221 

32281 

32341 

32401 

32461 

32521 

32581 

32641 

32701 

32761 

32821 

32881 

32941 

33001 

33061 

33121 

33181 

33241 

33301 

33361 

33421 

33481 

33541 

33601 

33661 

33721 

33781 

33841 

33901 

33961 

34021 

34081 

34141 

34201 

34261 

34321 

34381 

34441 

34501 

34561 

34621 

34681 

34741 

34801 

34861 



cgtgagccac 
aatagaagaa 
aataaatatt 
gtttaaaaat 
caacaaactg 
tgaatgaata 
tacatttgct 
gttacagata 
tttctcaaat 
attcttaggt 
aatgctatgc 
taagtacaga 
ttaatgtacc 
gaacacttac 
ttattattat 
agtggcacga 
gcctcagcct 
attttttgta 
cctgaccttg 
accatgtccg 
ttgaatacag 
ctcactgcca 
tatcaaaact 
taagaatatt 
ttagctaata 
acagggtctc 
cttctcccag 
acgcaccacc 
gcccaggctg 
gctgggatta 
cttcttaccc 
cagagcagtc 
cagtattctc 
ttcctgtaaa 
accacaggtg 
cattttcagt 
tgttaaagtg 
agaaatatcc 
gagggtcaca 
ctcttctgca 
gcaagctaaa 
tggcaactga 
tgattcttga 
ctttgacata 
ttaactaaaa 
ataaaaataa 
acagagcttc 
aaccattaga 
taaagataaa 
atgcataaaa 
ccacctcctg 
catctaccaa 
tggacaggct 
tgctgggatt 
gcaaattctg 
ccagtgatgc 
ttcaaaagct 
aatgccaatg 
cttcatactt 
tccttccatt 
taatttacat 
ccaagatgat 
aaataagatc 
acacaaatca 



cgcgcccggc 
acaaatgctt 
agggttttag 
actgcttagc 
ttaatattac 
acacaataaa 
tctctatcac 
tcatgacatt 
ttaacgtgct 
cagcgtggta 
aaacacctac 
cttacaaggg 
tcacccacca 
attagattca 
tattattatt 
tcatctcggc 
cctgagtagc 
tttttagtag 
tgatcctccc 
gcccacaaat 
tacactgtaa 
ctgctcagcg 
aaaatcaaaa 
ctcctatata 
aatgatccag 
actctgtcgc 
gctcaagcga 
acgcctggaa 
gtctcaaact 
caggtgtgtg 
aagatcaaac 
tgtcccacct 
ataaagtatt 
tagggaatcc 
atggtgataa 
gaaactagct 
gttctccttt 
tgttccccaa 
aaatggtgat 
aagagctttc 
tttctttatc 
ggaaggtttc 
actgccccaa 
ccctcattag 
gtttaaaagt 
taatagtatt 
ttattttgtg 
ctgaaggatg 
atcctgccct 
atcacctgag 
ggttcaaaca 
catgcctggc 
ggtctcaaac 
acaggcgtga 
attcagtagt 
caatgccact 
cagagaagaa 
tgtatatcct 
gccaaaaatg 
cacatttcag 
gcatctcctc 
aacaaaagag 
ctggtagctt 
tgggactcag 



cagtagttct 
tatagaacca 
gagaaaatta 
cttcaataac 
tcccaagtta 
tagactatat 
tagtttctcc 
tcacttctaa 
tgtgaatcgc 
ctagcaattg 
cacactctga 
ggttacatca 
aacattatgg 
gttgaacaaa 
attgagacgg 
tcactgcaag 
tgggactaca 
agacagggtt 
gccttggcct 
gctattttat 
aatgttagct 
ttgcaagagt 
gatcataagt 
acaatacctt 
attttcccta 
ccaggctaca 
tcctcccacc 
ctttttggtt 
cctgagctca 
ccactgttcc 
caagttccat 
ttctctccca 
ccacaatttt 
aatggactaa 
tgtcagactg 
ttgcccacct 
tacaatgaga 
aaacctttca 
catctaattc 
tctcaccaac 
aatttttaga 
atttgtattt 
atttggcagt 
tctttgaggg 
aacctacaaa 
aaaatgtcat 
aataagcaca 
acctcatata 
ccacaaaaat 
ggctggagtg 
attctcctgc 
taatttttgt 
tcctgacctc 
gccactgcac 
tccagggtgg 
ggtccatggt 
ttttactcca 
taagcagttt 
tttacccagt 
attaccaagt 
aaaggccaac 
aataaagtga 
ctccccttct 
atttaccttg 



taacttcaga 
taatacctat 
ttatgagccc 
ttgcttcctg 
caatgaaaag 
actgcctaat 
ccaactctac 
acacttcact 
agggggcagt 
tgcctttcta 
gtaacaagtt 
ggataaaccc 
ctttacccta 
atcatctaac 
aatgtcgctc 
ctctgcctcc 
ggcgcccgcc 
tcaccctgtt 
cccaaagtgc 
aataaagtgt 
attacccgtg 
atcatactgt 
caaaccatca 
agctcaccta 
atgtcttttt 
gtgcagtggc 
tcagcctcct 
tttttagtgg 
agtgatctgc 
tggctcctaa 
gcatggtgtt 
gctacccaat 
gtgtctaact 
attcaggtta 
cactacatRa 
ggaataggtg 
attaatctaa 
cctgctgatc 
tattatttct 
tggggaagaa 
gtaatttgct 
attcaaggct 
gcaagcccct 
tttccccaca 
aagcctatta 
atctaataaa 
atgacgctgt 
ttatattgct 
gatctattgc 
caacagcacc 
ctcagcctcc 
atttttagta 
aggtgatcca 
ccagtcagaa 
agtttaagaa 
ccacacctgg 
agttagactt 
ctcattattt 
agtcgtggga 
cacagatgtg 
atcattttga 
gtgtccccca 
caactctgac 
agagcactgg 



aattccattg 
gttcctacta 
tcctacatca 
ggcaaaggtc 
tttaaaccag 
atcaacaatt 
ttttttcctg 
gtgcatgtcc 
cttgttaaaa 
gaaagctctc 
actcagtaca 
atcgtaaagt 
gcctacctta 
acaaaggcta 
tgttgcccag 
caggttcacg 
accaccacta 
agccaggatg 
tgggattaca 
ttaattgctt 
tggttaagca 
atattgctag 
taagtcaggg 
cgaaaatgaa 
tttttttttt 
gtgatctcag 
caatatctgg 
ctgtgaggtt 
cctccttagc 
catcttttct 
tacggccaat 
gactgtagaa 
tattctttta 
gacatttttt 
gcacgtaata 
gagactataa 
ggatgctact 
cttgctggaa 
tctacatata 
ttagttcctc 
gtaagagtct 
gacaaccaca 
tcaaacaggc 
ttctggtaca 
tacaagttct 
gcaactaata 
ctagaaaaac 
gttaggcact 
agtgagtctc 
atctgggctc 
caagtagctg 
gagacacggt 
cccgcctcgg 
tcacctgagg 
ttacaattct 
ggagcaaggg 
tgagtttggg 
ttttctcaaa 
gtacacagaa 
gctgcaacta 
gatatcacca 
aaaaggagaa 
aaaggtgtca 
ccttcacctg 



taaactataa 
atttgagaaa 
gactaccatt 
tggtaaagtt 
acagaaaagt 
gttaatattt 
ctgaaagtaa 
taccacagtg 
ggtagatctg 
agatgatgcc 
gatgtaagtc 
caaaatgcat 
aatgtgctca 
ttttattttt 
gctggagtgc 
ccattctcct 
cgcccagcta 
ggctcgatct 
ggcgtgagcc 
atgtaattta 
ggagcgatgg 
ctcaggaaat 
accatctgta 
taaYttctta 
tttttttgag 
cttactgcaa 
gactacaggc 
ttgccacgtt 
ctcccaaagt 
agcttttctc 
ctcagtaatc 
gatgtgaagt 
tcccccatat 
gctagaatat 
caggctgatc 
ggtcactcca 
tagacaccct 
ttatttcatt 
ttagccagca 
ctttaaaaag 
cctccagtgg 
gtctttttct 
tcctgaatct 
aaatattctt 
tatattcttc 
aactttacta 
tccacccatc 
tttcaagaca 
aaaatttaat 
attgcaacct 
gtattacagg 
ttcaccatgt 
cctcccaaag 
gatataaaat 
cacaaactcc 
tttacatgga 
ttcaaatatg 
acaatgtaat 
aattaccacc 
aaacgtaaca 
gcaaagactc 
aagagtaaag 
caatccagcc 
gggcacactg 
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34921 
34981 
35041 
35101 
35161 
35221 
35281 
35341 
35401 
35461 
35521 
35581 
35641 
35701 
35761 
35821 
35881 
35941 
36001 
36061 
36121 
36181 
36241 
36301 
36361 
36421 
36481 
36541 
36601 
36661 
36721 
36781 
36841 
36901 
36961 
37021 
37081 
37141 
37201 
37261 
37321 
37381 
37441 
37501 
37561 
37621 
37681 
37741 
37801 
37861 
37921 
37981 
38041 
38101 
38161 
38221 
38281 
38341 
38401 
38461 
38521 
38581 
38641 
38701 



accatcctct 
tactgataga 
ggtgggggat 
caagtcccat 
acaatttaac 
actgttatgg 
agtgtgatga 
ctaatgaatc 
ctgtcatgtg 
agaactcaac 
aagtttataa 
ccaacaggca 
actacagctc 
tgcctgtaat 
caagaccagc 
gaagaatata 
aaactttctg 
aaagaacata 
aatgttagct 
aatctttata 
ctttataaaa 
taaaaagagt 
taatagtgaa 
gaattaggga 
agcatttctc 
cttgatagtc 
gaactccaaa 
caaacaattc 
aaaacttccc 
gaactgaggt 
caaaattttc 
agtaactacc 
gaggcatagc 
gctcatggtc 
actacttctg 
gttggaatct 
ggatctttac 
ttctttgtac 
ttagcctttt 
cctcctttcc 
taaatttatt 
taaagggcca 
tgacctctcc 
agtcttgctc 
tgcccaccgg 
acctgccacc 
ggccaggcta 
ccaaagtgct 
gaaactccag 
ggagttatta 
tatgattttt 
gcttttatta 
tttaaattat 
caaactgaag 
ctgttcctac 
tccctcttat 
ctcaaatcct 
ctcattttgt 
ctagagttag 
tgcctcctta 
cacactccag 
cgtcgagctt 
actattactg 
tagtcactac 



gcagatgctt 
catcatactg 
ctacgcctct 
ctataaaaag 
agttcacctt 
actgaatgtt 
tattaggagg 
gaattactgc 
tggacacaat 
tgtgctggca 
gccacccagt 
aaggtgatta 
tggggaggca 
cccagcactt 
cagggcaaca 
gcagatttgt 
taaaataaca 
acaacactca 
atcattacta 
aaaataaaag 
accaggatgt 
tcacaagaat 
gtttaaatta 
cttgagagag 
agacatggga 
gtttcaaaga 
ttacagcctc 
ctacatatcc 
aaatagaagt 
taaagagggt 
tgtaaaaggc 
caccccagtc 
tgtgtttcag 
tgtagtttgc 
gccaggtttg 
tcctccaaat 
cacaccacaa 
actgtccttg 
gttatagcta 
taactagcgc 
cacctcacct 
ccgctgactc 
gtagactgta 
tatcacccaa 
gttcaagcaa 
acacctgact 
gtctcaaact 
gggattacag 
gctttcttcc 
tccaccagaa 
ttgccctaaa 
acagctattt 
gtactctgca 
tcatcaccta 
acctattttg 
actccataac 
gctcctcttc 
gctgagtcct 
ctttctaaga 
gaacctacct 
ctgcagccac 
actgcttctg 
cttctccaag 
ctgagctttc 



cagcaacatc 
caacctccac 
aaagtcaacc 
gaaccaagag 
ttgcagtggc 
tgtgtctccc 
tgggaccttt 
cctcataaag 
gacaacccag 
tcctgatcgt 
ctgtgctact 
cataactgga 
aaaaaatttt 
tgggaggctg 
tagaaagacc 
catttgttag 
gcaattgccc 
gcaccacgga 
ttaaagcaga 
gatgtttcag 
attttcaagc 
gaagtgggaa 
catcaattcc 
ctggtcaaat 
gacagacgta 
aattatactc 
cagggacagg 
aaatcatact 
taaatataaa 
agagttaact 
ctaatagtat 
cttgtggcag 
taaaacttta 
caacctttcc 
gaatgcagat 
tctatgctct 
atggtatgta 
acgcaagcct 
acatttccta 
tccctctcct 
tcactcctcc 
ctacttgctc 
acttaccact 
gctggagtga 
ttctcctgcc 
aatttttgta 
cccgattata 
gtgtgagcca 
taccttcttg 
acagtgcctg 
tatccaagta 
agttgggaat 
tttccatcca 
ctccgtccca 
tatcactatt 
caccagtcat 
tccatcccgc 
gactctgctg 
gatgaaatgg 
cgaaataaag 
actcacaacg 
cttagaacaa 
ggctccctaa 
agtcatcgtg 



tcctcagtga 
ctggagtcct 
aacaaaatga 
ccctaatttc 
aacaattggt 
caaaattaat 
ggaaggtaat 
gagatcccag 
cagtctgtaa 
ggacttctac 
ttgttacagc 
ttctgatatc 
ttttaaaaaa 
aagcaggagg 
tcatctttga 
caagtagggt 
ttcttcataa 
caatgcacag 
aactgaggca 
ggctactgag 
tcataataga 
aatattaact 
ccccaaaaat 
ctacacaact 
cctcgcatag 
caagtaaatg 
atctggataa 
cagaaaaaca 
cgtagagctt 
gaggttcagg 
tttaggcttt 
gaaagcagcc 
tgtacaaaaa 
tctagagtta 
acacagtgtg 
tctttcatac 
tgccagctga 
taattaagtc 
tcttctactc 
tccctttacc 
atccctgttg 
tagcatacgg 
tccttttttt 
agtggcgcaa 
gcagcctccY 
tttttagtag 
taaattatac 
ccgtgcccgg 
actactccac 
acacatagta 
catgcctgga 
ctaatctcac 
aatgtcctac 
cctcccaaaa 
tccctgctgc 
tgtgttgtat 
ctctcatggc 
cgccctctcc 
actacaccat 
tacaaacctc 
tgccattctc 
ctaaactgtt 
atgtttttcg 
ctaattctct 



ggtcattctc 
ggtctgtccc 
aacatcagtc 
aaacacttgc 
attctgaaga 
atgctgagac 
taggtcatga 
agagccctct 
cctggacgag 
cctctagaac 
accacaaact 
agtcaggcct 
aagctgggtg 
attgcttgag 
tttaaaaaaa 
aacctccata 
aattgttagg 
gaagcattca 
cagctttaat 
tgctatctat 
ctcacatctg 
gtaaatgaca 
taactcaatt 
gtcgtcatgt 
cctaggtttg 
ttgggatcaa 
gaaataggga 
gaactctatc 
ctagtcgtat 
gccctagagc 
gtcagtctga 
acaggcaata 
caggcagcaa 
aaaagatcgg 
tatatatgtt 
tctccttttt 
agctcctctg 
tatggtttct 
cagtaatctt 
cccaagcttt 
tgccactaaa 
acactttcag 
tttttttttt 
tctcagctca 
gagtacctgg 
agatagagtt 
ctgcaacctg 
cctgctccct 
agtgttgtgc 
ggcactagta 
taatctcact 
ccctccactc 
aagcatctca 
catccagcac 
ccaaatagga 
caattccacc 
ctctgttcag 
aatccatccc 
tcctctgctt 
aaattgtatg 
caaatcattt 
aaggactatg 
aatgaataaa 
tactataaaa 



aggcagaaaa 
agcgtcacat 
tcttgtggtg 
aacaataatt 
aaagtaatgc 
cctcactccc 
aggtgaagcc 
agccttcttt 
aaccttcacc 
catgagaaag 
aagacaaaca 
agagtaaaat 
cagtggctta 
ctaagaagtt 
aaaaaaagaa 
acccctaggg 
attaaatggc 
agtgttctca 
taaaacaatg 
atttaaagtg 
gctcttcaaa 
gtccaaatat 
ttactacaga 
atattctaaa 
ctggggcaaa 
tatctaaatc 
agttactaaa 
tactcttaca 
actatttcat 
agggcagagc 
tggtctctgc 
cgtacaggaa 
gctggatctg 
ctattggttg 
tcaatatcat 
ctgcacttat 
ttcataattt 
tatcaggtaa 
ctggattctc 
aagagtagtc 
gtgactctca 
tcagttctct 
tttgagacgg 
ctgcaacctc 
gattgtaggc 
tcgccatgtt 
cttcggcctc 
tttaaaggca 
ttattcctca 
aagtatttgc 
cactgccaca 
caggccctta 
gactgtgtcc 
caccataacc 
aacctatgct 
ttctggatat 
atcctcattt 
cacactactt 
aaaaactcta 
caaggccctc 
ttcttcatga 
tcttgcctta 
catgttctcc 
tatttctgat 
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38761 
38821 
38881 
38941 
39001 
39061 
39121 
39181 
39241 
39301 
39361 
39421 
39481 
39541 
39601 
39661 
39721 
39781 
39841 
39901 
39961 
40021 
40081 
40141 
40201 
40261 
40321 
40381 
40441 
40501 
40561 
40621 
40681 
40741 
40801 
40861 
40921 
40981 
41041 
41101 
41161 
41221 
41281 
41341 
41401 
41461 
41521 
41581 
41641 
41701 
41761 
41821 
41881 
41941 
42001 
42061 
42121 
42181 
42241 
42301 
42361 
42421 
42481 
42541 



tataggctgg 
cgaatcacaa 
ctaaaaatac 
gaggctgagg 
gcgccactac 
gattatataa 
atgaacacat 
tctgagtagg 
taccaagtgc 
gtttccgagg 
agtcgaccaa 
ccaaaacaaa 
tgttcacatg 
agttaaggct 
tttcgcaatt 
gaaagctgtc 
aagggcaaga 
tctaaatggg 
ctgaggatga 
taatctttga 
aattaagatt 
aacatgagac 
ctgacaaact 
gctctgtcac 
ccgggttcac 
caccacacct 
ggatggtctt 
ttacaggcgt 
acacctgtga 
ttgtgcccct 
gcttggaaag 
ctatcaaagc 
taagaaagaa 
aattgccatc 
aacagaagca 
atcatcttct 
ttaaggacaa 
tctgttttat 
gaaataggca 
acacaggggt 
ttatatttat 
actggcctag 
tcttttcctt 
attaaaaatt 
gccattttac 
gccataaaaa 
atgaaattta 
tatgtgtgac 
gctgaattct 
cacgcacaca 
aaggcaggaa 
ttatatatgg 
gacaggcgga 
catctctaca 
ccaggatcac 
tgatggagtg 
attagattaa 
aataaaaaga 
gagactaaat 
actaacccag 
atgacatcta 
gagcactata 
agggttcttg 
attctcaggt 



gcgcagtggc 
ggtcaggagt 
aaaaaattag 
caggagaatc 
actccagcct 
attatacttg 
tccatattcc 
agagtaggat 
atgtactctg 
aattggaaag 
aacataaatt 
tagaaaaaaa 
cccacacata 
gctataacaa 
ctagagtctg 
ttataggctg 
aacatatctg 
ggcccatgac 
gtatttcaac 
tcaggaatag 
tactatgtca 
ctggccccaa 
ataaactgta 
ccaggctaga 
gccattctcc 
ggctaatttt 
gatctcctga 
gagccaccgt 
aaccattacc 
tcctaatcct 
ttttttgaat 
taaggaaatg 
tgactcaaat 
acgactcaaa 
cttctcttta 
aagtaatata 
ctaagaccct 
cactggtgat 
tcactactaa 
tcctaactgg 
atttgagtcc 
gacatagcag 
caaatgttgt 
gttttaatag 
taagtctcct 
acttggcctc 
acatgcttta 
atcggtatct 
tccatttcta 
cgtgcacaac 
taaaaataca 
gctggggcgc 
tcacttgagg 
aaaaaaaaat 
ttaagaccag 
agatcttatc 
gacagaattg 
agtataactt 
gcagatagat 
agaattttaa 
taatagagtc 
atttttttgt 
gtaaagttca 
aaaagttatt 



tcatgcctgc 
tcgagaccag 
ctgggcctgg 
gcttgaaccc 
gggtgacagt 
caacaaacaa 
aaagagtaag 
agccatattc 
ctctggtcta 
gagagtctgc 
actgagctga 
aaaccttcct 
tacatggaaa 
agtaccacag 
gtggttcatg 
aagactccca 
gggtcccttt 
ctaatcactt 
tgtgaatttt 
cagatgcaca 
aataatgttt 
ggagttcagt 
cataaagtat 
gtgcagtggc 
tgtctcagcc 
ttgtactttt 
cctcgtgatc 
gcccggccca 
acaatcaaaa 
tcctcacaca 
aataaaatgc 
ggaaggaaaa 
attaattgat 
aaaaacaaaa 
cacagaccac 
acagtgggaa 
ttctgcatag 
gctgacatct 
tttgttgagc 
gggcctgaga 
atagacttga 
gcgttcaaca 
cttgctgtaa 
atctagttaa 
atgtaagagt 
ccctgaggaa 
ccttctgctt 
cttgtaatta 
gtctattaac 
cacatgcaca 
caaaatagta 
agtggctcat 
ccaggagttc 
gttaaaaatt 
gaatttgagc 
tctaaaaaaa 
tttaccaaat 
catacaaggt 
tgtaaaagat 
aaactttcta 
cttaatactg 
tgtttgcgtg 
atactatagc 
aaggccccac 



aatcccagca 
cctggccaac 
tggcgggcgc 
gggaggcaga 
gcgagactcc 
tctcaatagg 
agggaggaga 
tggacaggct 
aggatgaaaa 
agtccagaag 
tccagtgtgc 
taaataacca 
aagcattctt 
gtaaggtggt 
atagcggtgc 
acttcttgtt 
tataaggcca 
cccaaagact 
gtgaagacac 
gtgaaattcc 
tatcttagtt 
tttcgaggta 
aatttttctt 
gcgatctcag 
tcccaagtag 
agtagagacg 
cgccagcatc 
taaagtatac 
tagcaaatgt 
cttcctctct 
taagctaaaa 
ggtagcagag 
aaatgatcaa 
gtgccacact 
tcacttcctg 
ctaaaggcag 
aaatcaaggc 
ctttatccaa 
tttaatgaga 
gtctgagttt 
gcagtctatg 
aatataaaYc 
tgtcattagg 
aaaagagaag 
ctgaatagga 
ctggcatgct 
tagctgcaaa 
atactattta 
taccccccta 
caatttccaa 
agaaagatag 
gcctgcaatc 
aagaccagcc 
agccaagtgt 
catgatcaca 
aaaataaaaa 
tccagaatat 
gaaatgcatg 
tatataaatg 
aggctgaggt 
ctggaggcat 
acagagtcca 
taaacaacaa 
taaagtaaga 



ctttgggagg 
atggtgaaac 
ctgtaatccc 
ggttgcagtg 
atctcaaaaa 
ataattttcc 
agacagaagg 
gtatgcctga 
ccagtccact 
aaacattgtc 
atgtcctatt 
gggaattctt 
gaaaaatata 
tataaacaac 
caacacagtc 
gtatcctcac 
ctaatcccat 
ccacttccta 
aaacattctg 
caagtgtttc 
tctaagacca 
ccttttaaat 
tttttttgag 
cttactacaa 
ctgggactac 
aggtttcacc 
agcctcccaa 
ttttaagtgt 
atgcatcacc 
accatccccc 
ggtaacagac 
tagtttcaat 
gtcaacagtt 
ttattaggaa 
agccatgtgc 
tccaagaaat 
ccaagatgct 
gttctgcttc 
attaaccata 
ttgaagggcc 
gctccaggag 
cttatttcac 
aagaaacaYa 
aagaaagaac 
agctgtcttt 
ccttcctttc 
gaacttgggc 
ttcctttcct 
cacacatacc 
acctctatgg 
tgtgaccaat 
ccagcacttt 
tgggcaacag 
ggtgacatgt 
cgaccacatt 
ttttaaaaat 
tataatatag 
tattggtcta 
gcagactctt 
tgaggttaag 
agcacagggc 
aagaataaag 
tgaaaaccca 
aaatgaaaag 



ctgaggcggg 
gccatctcta 
agctacttgg 
agccgagatc 
aaaaaaatcc 
ttaaagtttc 
gttacctctt 
agtccccaag 
ggcagtgcct 
atgaatagga 
ccaaatcttc 
ttcatctcta 
cgattgtctt 
agaaatttat 
gggttctgat 
atgacagaga 
tcatgagggc 
atataacaca 
tctgtaaaaa 
aattaatttg 
ctaggacaaa 
tgtgttataa 
atggagtctc 
gctccgcctc 
aggcgcctgc 
atgttagcca 
agtgctggga 
tgatatacag 
ccaagtttcc 
actcccatct 
tcaagaggtt 
ttgataagca 
tcaaagagcc 
gggcacagga 
agttctggcc 
aaaatgataa 
gccagctagg 
ctcagatgta 
tggaaaccaa 
ccatgaaatt 
cttaagaacc 
ccatcctctc 
ctgtttgtca 
tcatctatgt 
tgaattcagt 
taaggaagtt 
aaattcgacg 
tttttcctaa 
ccacaacaca 
taggggaaag 
gcctctaata 
gggatgccca 
agcaaaaccc 
gcctgtagtt 
ccggcgtggg 
ttacatatga 
gcttaggaca 
gggcatcata 
catagcttaa 
gggtttagtg 
tggatgaact 
ttgatattat 
ttaaacatat 
gtgggaattc 
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42601 
42661 
42721 
42781 
42841 
42901 
42961 
43021 
43081 
43141 
43201 
43261 
43321 
43381 
43441 
43501 
43561 
43621 
43681 
43741 
43801 
43861 
43921 
43981 
44041 
44101 
44161 
44221 
44281 
44341 
44401 
44461 
44521 
44581 
44641 
44701 
44761 
44821 
44881 
44941 
45001 
45061 
45121 
45181 
45241 
45301 
45361 
45421 
45481 
45541 
45601 
45661 
45721 
45781 
45841 
45901 
45961 
46021 
46081 
46141 
46201 
46261 
46321 
46381 



acgaagaaaa 
cagctctcca 
ctctcagtca 
cataaacata 
caaaatatag 
agttgtattc 
atcatgcaaa 
cacagaaaat 
catatcaatt 
tccttgaggt 
agtgatgcag 
caagaggtat 
tctggtagca 
cctgtggggc 
aattctgaag 
accagctgtg 
gtctaatgga 
ccacaccagg 
gtttgcttaa 
caacacctcc 
actctgtccc 
caatctaact 
tcctacccca 
caagtttcac 
ttaaatatag 
ggaactgtca 
atactaaata 
cacttttttt 
aatcaagaat 
ctagaataat 
taacttaatg 
tctagagaaa 
tggtaccaga 
aaatattaag 
cgtgtttttt 
attatcatga 
aattacacta 
ctccaagtct 
atcagatttc 
tagtctacat 
aggtactcat 
aattcataaa 
ttttagaaaa 
aacagtatac 
ttcatggtaa 
aactcaaaaa 
ggaaactaaa 
aaagcaggta 
tcactctgac 
ccaggttcaa 
caccacgcct 
gctagtctcg 
attacaagca 
tttctttttc 
tgcagtggcg 
cctcaacctc 
gtatttttag 
tcgtgatccg 
caggccaact 
gacaaaatgg 
tatttttcta 
ctccagcaag 
gcttaggcag 
gatattgccc 



cttgcctgca 
ttaccccttt 
ctaacttgaa 
aaagagatag 
aaatgatttt 
tgatactgtg 
atagcttcct 
tcaataagac 
tgcagaaatc 
aaatatggtt 
cccatccagg 
ttcaaaaggt 
gcaataacaa 
tactaacaat 
tcagcgtagc 
tgattttagg 
gactgcaaag 
cactgcacat 
ggatcaaact 
tccaggcctg 
cacttctctg 
ctcaaataat 
ccccccatcc 
ataagcagtc 
atgaaaagga 
tgttttattt 
acttaataac 
tttaatacga 
gagtagcctg 
cttaacctct 
tttttttaaa 
attgtaatag 
aagacatttt 
gaatggccaa 
ctttttaaaa 
ttattttatt 
aaaatgtttc 
taaaaattcc 
tatcctttca 
cttcagtttt 
ggactaggca 
ggttatccaa 
tgttctaata 
agaaccttta 
acttttctta 
tacttggaga 
atacattgtt 
tctcttggag 
acctaggtgg 
gcaattctcc 
ggctaatttt 
aactcctgac 
tgagctaccc 
tttttttttt 
caatcttagc 
ctgagtacct 
tagggactgg 
cccacctggg 
ctacatttta 
gacattctct 
gaaaatatat 
tacaatgtac 
atataagcat 
atttaacagg 



tagtggtgac 
ttaggtcaga 
aacatcctat 
agatatcaaa 
tactaaaata 
agtgtatttg 
ctggaaaggg 
ctacaacgag 
atattcataa 
attgctccta 
gtcatccagc 
atgctgaaga 
tgatctctgg 
catgaatgtg 
ccgaagagac 
cacgttcctt 
gaaggtgaaa 
aaaattgtga 
gagtccataa 
tcccatctgt 
tgatgtgttt 
cctttaatta 
tacagtcagt 
ttgggggttc 
attgatgaaa 
tagaaggtaa 
acattcttat 
aagctccact 
tggaagaaca 
atatattttc 
aaaatgatag 
ttttaacaat 
acattttaaa 
tagaatctgg 
agaaacaaga 
agaaaagcaa 
atttattgcc 
cacccgtatt 
ctgttcacag 
tcctttttgt 
ctctagttgt 
ggcttctgta 
atcctcatgt 
aaagccctgg 
accttccctt 
aacaaaactt 
aaacaactga 
tatattttaa 
agtgcagtgg 
tgcctcaacc 
tgtattttta 
ctcaagcgat 
tgcccggctg 
tttttttgag 
tcactgaaat 
gggattacag 
gtttcactat 
cctcccaaag 
aatgtttata 
caacattctg 
aatccttata 
agaaatcacc 
gcatatttcc 
tcttatatat 



atttgtcaca 
gactgccaca 
cttgattttt 
ttctaatgct 
attctttgca 
gaaaataaga 
cctattgagc 
ataaactttt 
atgaatgtgg 
tttcactgct 
cagtgaatgg 
caggctcctg 
aacagagtcc 
ctggtgttaa 
acaggtggct 
aaatcccctt 
tcaatactga 
accacaggca 
aaaagatagg 
ctacttgcct 
ctgccagctg 
ggttgctttt 
atctgtctgc 
aattctaaac 
aaagattata 
ttagagtcag 
ataccatgtc 
ggatgattta 
gccaaaaaaa 
cctaagctct 
aataaaacgt 
gacaaatgaa 
tggaccaaac 
gtagggaagt 
aaaacatgtt 
attatacata 
cttcaagttt 
ggcagtatca 
cattcaactc 
tctcattttc 
tgggcaccat 
attatcactt 
ctatctccca 
aagagactga 
aacctatcac 
tggaaagaca 
tgacttgaca 
ctgatttttt 
cgcaatctta 
tcctgagtac 
gtagagacgg 
ccgcccatct 
atttttcttt 
acagagtctt 
ctccacctcc 
gcatgtgcca 
gttggccagg 
tgttgggatt 
aaataggaac 
tcactgtttt 
aaaggtgcta 
actgctaaca 
gcatagatgg 
ctttccatag 



gggtgccctt 
ctcaagaaag 
taaaatttaa 
catactaaat 
cttaccggag 
aaagtcacat 
tcgaagtatc 
cattacaaca 
aatttaatcc 
aaagacaatg 
tagagctcag 
gaacccagag 
ctgaatttaa 
agaaccataa 
gcaatttaat 
aagcctcaat 
caaaaatgtc 
ttctggaaaa 
ttataacaag 
ccatggtctg 
cacaccagct 
actcctgtgc 
atgaacagca 
aaacagacaa 
attcacccga 
tgggcggtaa 
ccaaaaaggt 
aaaccaaaca 
agacacaaat 
gtgatgctac 
tacaatttca 
gatagcatat 
tccagctgga 
ctggaaaaag 
tatattttcc 
aaactatatg 
ggtcccatcc 
gtctgctttc 
acttactatc 
cctatttggc 
gatcaaaact 
aaatttgccc 
tgacatgagc 
agccatcaac 
ccaggatgag 
gtacggaaac 
aaataagtag 
tttttttttg 
gctcactgaa 
ctgggattac 
ggtttcacta 
cagcctccca 
tttttacact 
gctctgtcgc 
aggttcaagc 
ccacgctcgg 
ctggtctcaa 
acaagcatga 
acacactaga 
gctttccctt 
aataaacaga 
gttgggatat 
aaaaagatat 
caacacagat 



ctccccagct 
gcctgagata 
gacataagat 
accagacccc 
aaaaggaaac 
tgcttgaaat 
taaatttccc 
cttgtactct 
tcagaaaaac 
tggcttaaaa 
aactaaaatg 
aaacttcccc 
agatgacaat 
ccgtctagtg 
accggtgttt 
ttcctcatct 
caatgagcac 
gagagctttt 
aacagttttt 
catctcactg 
catctttttt 
ggtcctggat 
gaaaagatcc 
aagatttaga 
gtcccatagt 
aacatttcag 
caattttcaa 
gaaccaaaaa 
agtgttaaga 
attacattca 
tttctaaaac 
ttcctattta 
acagaggtta 
tttgtgtgca 
aggccttccc 
gatttcagta 
atcatgcatc 
ctagaaatta 
ctgccttttc 
aatatttata 
gacacaaaca 
tttgatatta 
cttccatcag 
tagagaagcc 
gatgaatcac 
acagaaccat 
gggggtatta 
agatggagtc 
atttccacct 
aggcatgtgc 
tgttggccag 
aagtgctggg 
tgtccttctt 
caggctggag 
aattctcctg 
ctaatttttt 
tctcttgacc 
gccaccacgc 
aaaaaaaagg 
aaaatggtaa 
aagtcaactc 
atcttttagg 
aaatatgttt 
aatgctagct 
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46441 
46501 
46561 
46621 
46681 
46741 
46801 
46861 
46921 
46981 
47041 
47101 
47161 
47221 
47281 
47341 
47401 
47461 
47521 
47581 
47641 
47701 
47761 
47821 
47881 
47941 
48001 
48061 
48121 
48181 
48241 
48301 
48361 
48421 
48481 
48541 
48601 
48661 
48721 
48781 
48841 
48901 
48961 
49021 
49081 
49141 
49201 
49261 
49321 
49381 
49441 
49501 
49561 
49621 
49681 
49741 
49801 
49861 
49921 
49981 
50041 
50101 
50161 
50221 



ttaccatttt 
atctcctatt 
aaataaaata 
aatctatttg 
ttaaactgta 
accatgtgag 
ttcttcttca 
ctggatttcc 
tacagtgcga 
aaaacctggt 
attttaagtg 
gaccataagg 
cattacaatc 
ccagatgagt 
atccattaaa 
cctaaacagc 
ttcctcccca 
tcaattgaat 
atgtttgtgc 
acttcatgaa 
ggtttagagt 
acacattttt 
ttcagtagat 
tgtaggaaag 
tgctgagtat 
ggtgttaaga 
ctgttcgagt 
ttaatacatc 
tgttgcttgt 
ttcaggtttt 
atataccaag 
cacatttaga 
cacagttcat 
ctgtagtacc 
ctacactgtg 
gaaattactt 
cttcactgca 
taatttaaaa 
gtcaagtgaa 
ctagctatag 
ttgttcttcc 
tttagaaatc 
gggatggccc 
tctttagaat 
ttatctaaag 
catgattatc 
tttctttaat 
tggtcttgaa 
ttacagacgg 
tctcttcacc 
tttgagaact 
ccagtcttag 
gcaatctaac 
gtttgattcc 
tagtcaccta 
aggcaagtaa 
gctgccatca 
aaggctacac 
tcatctcctt 
acgaaaaaat 
gcacatcata 
aaagtcaaga 
aaacagattt 
tttaacttca 



aaaagctaca 
aatgaacatc 
gttctgattt 
agagagtttt 
cctaaaaagt 
aatacacaat 
agaaatatac 
caaaagctac 
accaggcaga 
cacttgacaa 
attttttttc 
tgtccttttc 
atctggaggg 
taaatcaaaa 
tactatgcag 
attctagcat 
gagggaaata 
agaatttaca 
aattcattcc 
ataacatgcc 
tataaataac 
gttgggtgga 
actgccagtc 
tgttccaatt 
atagtggtat 
aacctttgct 
cttttacaca 
atagatatga 
cttttcactc 
tcctttatga 
atcatgaaga 
gctaaaaatc 
tgtttcatat 
ccatgttctc 
gactgataga 
aatgagtaca 
caatatatcc 
ataccaccat 
catatgtgtt 
acaccaatcc 
tcctcaaatt 
agcctgtcaa 
tgaatatata 
tttgtaatct 
aatgttgaga 
atcccagaat 
tttatttctt 
ctcttggccc 
aaccacagca 
aatatatctt 
cctaacatag 
ttatctctaa 
atccattgta 
tggctgtgca 
actggcatac 
tgtacttcct 
gttggctgct 
aataaagcat 
attgaaagaa 
tagtgggaag 
agcattacgY 
caacttaaac 
agcaaaagaa 
cctctaaggc 



gagtattcca 
taggctgtta 
tgtggaagac 
tgctgtgaca 
cttgttctag 
gattaggacc 
ctgaatttca 
aggaaaatga 
agccaggaaa 
aataggtagg 
ttttcattac 
ataagaaatt 
gattaaaaaa 
tccctggggc 
agaaatacag 
cttagtggtc 
gggtcctaag 
gcacaatttt 
agctgctata 
acaattatct 
tctaaacatt 
acctgtgagt 
ttccaacaat 
gcccagaagt 
ctcattgtga 
ttgcttattt 
tattaaaaat 
gtattctgac 
tgttaccttt 
ttagtccctt 
tacattccca 
aagctgatgc 
gttcatctac 
tcacttataa 
cattgaagac 
atgtacataa 
tgtacctaca 
ctccccaccc 
tgcttctaga 
taacacaaat 
tatcttgact 
tatccacaca 
cattaattta 
atgaatttcc 
tgtattatgt 
acgtaatgtt 
aacgttttgt 
caagagatcc 
cccgaccctg 
ttataaaaag 
tgtttgataa 
cctaacatct 
caattttttt 
ggattttgtc 
agtaaatgtt 
ctgggcttca 
gaaagtatca 
catttattaa 
aagaaagaaa 
aaaggaacta 
tgagaaagag 
agctcagtct 
gacacatact 
atatggaaaa 



ttctgggtat 
ccattatagc 
tattccagaa 
gtgtaacaga 
tctaagatat 
aaaaattaat 
ggaataatta 
tgcctcatca 
taaaagggag 
ggtattaaga 
acatttcttt 
ataatacagt 
aaaacagaac 
taaggtccag 
aaagtttaca 
cccctgtggc 
ttcaaacact 
tgtgtgtttg 
ggtagctata 
gttctactgt 
ttctgtgcat 
agaattatca 
gttttgccaa 
ctcaccaaca 
ttttaattcg 
gctatttgga 
ggaactgtcc 
agatacgttg 
agattaacag 
ttgtgtcatt 
tgttaacttc 
tgatttttac 
ttgacagcaa 
atgggagcta 
tgaggaaggt 
ttccagtgat 
cttgtgctcc 
tactgctttg 
gtctcaattg 
attacaactc 
atttctgatc 
cacacaaaaa 
ggaagaaatg 
gatattcctc 
tattttctct 
tcttgaggct 
agagatgagg 
tcccccactc 
ctttttcttt 
ttctccatgt 
acaagcaaag 
gtgtgaaaat 
cagaagattg 
atttttgatg 
tgttaaatga 
gttccctcaa 
aacaaatggg 
tttccattag 
gaaagacaat 
acatgtacag 
acttccgaag 
ggtagaaact 
caaaaaatat 
ggatttatta 



gtaccataat 
atgaataata 
agcgtatgtg 
tgacctttga 
ccagaaataa 
ctgcagtctt 
caactttcat 
tcttcctttc 
ggacactgta 
ggaagtatct 
atttcttaaa 
gattatacac 
actgctgcct 
gtatcaatta 
cacccatgaa 
ccctcccaat 
aggaatttat 
gcttctttaa 
actcattacc 
taataaacat 
gtcttttggt 
ggtaataggg 
tgtataccgc 
cctggtatca 
cacttcccta 
tatcctcttt 
ttcttcaatt 
cgaatatatt 
ttttcatttt 
ttaaagaaat 
cagaagtttg 
acatggtgta 
ccttcactaa 
tgtaatgtgt 
gggagggggc 
ggatacacta 
ataagtttat 
cagttccacc 
cattctactg 
tcttaactcc 
ctttacattt 
aacctgctgg 
atacctttaa 
tatttcttta 
gtagaggtct 
attttaaatg 
tcttgctata 
agtcttccaa 
taaatgtcta 
aattctaatg 
gataaatatg 
actttggcag 
ctttaaccaa 
ttccaacgtc 
tgcctgaact 
agtaaaattg 
aaacaacttt 
ctttcctgtg 
caggcactca 
ggcattcaca 
agtgacaaca 
catttaattt 
cctgaagaag 
gaagcatttt 



tattcaccca 
aattaataag 
tcaggcaata 
ccttaaatgg 
aaacaaatag 
tgcttctgaa 
ctgttatttc 
ctcaaatgcc 
tacattatgt 
ctgggggtgt 
tttctatagt 
cctggctatc 
gggaccctct 
aaaaacaaat 
ttaaccagca 
cacatcttct 
ttggacgaat 
aatcagcatt 
ataattgtat 
ctgtatttct 
acacgcatat 
tattctcatc 
atcaacttac 
ttcatttatc 
attactcatg 
gtgaaacagt 
ttaggagttt 
ctcccactct 
agcagtcaat 
ctttgtcaac 
ttttatcttt 
agatagaggt 
aaataccacc 
acacatgggc 
ctgaagatga 
aaagccctga 
acaaataaaa 
tttcacataa 
gcttatttgt 
tgctccaaat 
ccatataagt 
gattttgatt 
aatatcttta 
gtttctcaat 
ttacatatcc 
tactttttct 
ttgatcagac 
agtgctggga 
tttctagctg 
tttagctggg 
tatttttttc 
tgtaaccctg 
accagttgtg 
taacacagta 
atgtgacctc 
aggtaactgt 
taaactgccc 
cttcacacat 
gagcagccct 
gaaattccag 
aaagatactg 
ctgctcataa 
ctaacaggaa 
atgaggcata 
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50281 
50341 
50401 
50461 
50521 
50581 
50641 
50701 
50761 
50821 
50881 
50941 
51001 
51061 
51121 
51181 
51241 
51301 
51361 
51421 
51481 
51541 
51601 
51661 
51721 
51781 
51841 
51901 
51961 
52021 
52081 
52141 
52201 
52261 
52321 
52381 
52441 
52501 
52561 
52621 
52681 
52741 
52801 
52861 
52921 
52981 
53041 
53101 
53161 
53221 
53281 
53341 
53401 
53461 
53521 
53581 
53641 
53701 
53761 
53821 
53881 
53941 
54001 
54061 



attatatctt 
aagcaaatgg 
ctttttttgc 
ttcttaccca 
ctaatacttt 
actcttcctt 
accaaattct 
caaaaatggt 
gttatcttct 
tattagcttc 
ctaccttggt 
caagtccaat 
gatgttaaag 
atccatctca 
atctgccacg 
ctcatcctgc 
tttgtatggt 
tatgcaagtc 
ggagaatatc 
acagaatgac 
atcatagcaa 
ttttttaact 
aagttctgga 
tatctcaaat 
ctattttgaa 
tattcttatt 
cacctgggtt 
gaggaagatc 
ctggcacaca 
aaaaaaaagt 
agtttgcaca 
atattatgtt 
gtttcatgga 
ttaacaaacc 
atcccattct 
ggctcatgcc 
agttcgagac 
agctgggtgt 
ttgcttgaac 
ctaggcgaca 
aaaaatggtg 
gaatttgaag 
ctattaaatt 
ctgcaatgat 
ttcctgttgg 
caaatcgatt 
caagcttttg 
atacacattc 
gtgagtatgg 
tttaaaaata 
agtttgaaaa 
tcattaagcc 
gcagcaaaga 
gagataatac 
gaggaggtta 
cgtgccggtt 
taggctgaca 
tgcagaagcc 
cagccacccg 
ctgcctgggc 
ggccccgcag 
ggtgcagacg 
tagccccagc 
cctcccgggc 



atgacttgca 
ctaacatcaa 
catctgacct 
gaatcacttt 
cctttttctt 
tttactattt 
aagagaaatc 
tcagctactt 
tacaaatgtt 
acacactttt 
aaagcatcat 
tactatgaaa 
gcagcagtga 
tgtgcctact 
ttccaagatt 
caaagatggg 
tatcagaatt 
ctgaactatg 
agcttttatt 
aataccaatg 
agaggcaaaa 
tttgtataaa 
cgcctgctct 
tcttcacctg 
gaatgaggat 
ggaagacagc 
caagtcctat 
attacattgt 
ctatacagag 
tgcccttttc 
agggcttttc 
tcagaactgc 
aaagcaca tg 
atgagctgtt 
gagttctcac 
tgtaatccca 
cagcctgaac 
ggtggcgggc 
ccgggtgtcg 
gagcgagact 
tactcatagg 
tggcaataga 
gttatttttt 
tgggaatttt 
atatgagaca 
agaaatttct 
tgtttctgtg 
tatctgtatg 
aatgttttcc 
aagcctcatt 
gYctctagat 
gctagttcca 
ggcgctttag 
tgtgttaaaa 
atagtaagac 
ctggactcac 
gacccctggc 
aggcgccccg 
gtgttcggat 
cgggctgggc 
cctcgcccac 
cttcgcacct 
gagaggtttc 
tctcgctgcg 



gagtaggagg 
gttggggcac 
atagtaggga 
ttctttccaa 
tccgaatcag 
tactaaaatt 
catcatagcc 
gagttggtca 
caatagccct 
tagagaaaga 
caatagcaag 
ttccacggga 
actgggactc 
ggtcacagaa 
tatttagcat 
taaatcctcc 
atcatcatca 
tccttctacc 
atcactcagc 
caaggcaaaa 
agcagtgtct 
aatttctagg 
cctgactcag 
ttaaaataga 
tcacagaagg 
tttacggaag 
ctgtaaccct 
agggtggtgg 
tgaatgttag 
agctttggcc 
aaagataagt 
aactagtcta 
atgatctgcc 
gcccaaaccc 
tgagatgaac 
gcactctggg 
aacatggtga 
gcctgtaatc 
gaggttgcag 
ccgtcaaaaa 
gagccaagcc 
acttcgttga 
acagttcatt 
gtgatctctg 
acagagaaca 
tgccctcaag 
tttatttttc 
tctatgtatc 
tgtctccaga 
atgggttcat 
ttgggggtgt 
ccccaagtct 
taagttgaat 
aaagagaatt 
ctccttagcc 
cacccagcgt 
ggattttgac 
accttaccct 
gccagcaccg 
ccacgccagg 
gctatcccgg 
gccagctccg 
ctcttccggc 
tcacagcggc 



aaaaaagaca 
ttacattagc 
cttgaaacaa 
taattccact 
atcctaatct 
tacttctact 
tagtcttcat 
gcagtcaacc 
aaaggttgaa 
cagcacttgc 
aagactgaaa 
gttactggat 
caagagaagg 
agcaccctcc 
tcaggccact 
taaattacag 
aacatcaatc 
tttctatact 
agttaagtcc 
agaataacat 
gaaaaaaatt 
tattaaatac 
gggctggcct 
gatagcatct 
attaagtaca 
tggaaggact 
agcagctgtt 
ggattaaata 
tgccctatta 
tcagtcctat 
gcatagcaac 
ttaacctcaa 
tggcacaact 
attcaccaag 
aagaattacc 
aggccaaggc 
aaccccgcct 
ccagctactc 
tcagccaaga 
aaaaaaaaaa 
cgggcatgtt 
ttcttgtttt 
tatggatgac 
taaacgaatt 
cagtttttgt 
taaggagaac 
atttgtggct 
tgtaattcag 
tggtattaat 
tttccctctc 
aaaaattcca 
tgctgtccag 
taaagccaca 
atttatctct 
atgagtgttt 
gtaccaacgc 
ctcgcaagaa 
cgctggaatt 
ctacgaccag 
gctggggaga 
ggcgcccgca 
cagccgccgg 
acctgacccc 
gggtctctgg 



gcacatcaaa 
tttcctctaa 
tttttcacac 
ttaaaaagag 
ggcaaccata 
gtttgtgttt 
aattctcttc 
gcccctccca 
agggaacaca 
aactaaccat 
ggcccaaatg 
atgaatctaa 
gaaaactgct 
tcagacattc 
acagaagtca 
cagatgctca 
cagccatttt 
gatttttagc 
ccacattgtc 
tgaggaatat 
aatttctcaa 
taacaccaag 
cgatcaacta 
cttctacctc 
aagcaccatg 
atgaaatttg 
catttgttcc 
acactgcacg 
taagcaaggt 
atacaaagat 
acaatcacat 
aacaaaacag 
agtgcctttt 
cacaagactt 
tgaaatcggc 
gggcagatca 
gtactaaaaa 
tggaggctga 
tcatgccact 
aaaagtcacc 
gcttcttggc 
tctcatttta 
aagttatttc 
aatatctcat 
ccatttgtct 
aactctgata 
gaacttgtag 
aaagtccttt 
aaattagatg 
aaactctgta 
tgcagcttga 
aagctcagac 
aacatggctg 
tcccatttcc 
ccgctgcagg 
ccccagcacc 
atgcaaaagc 
cccgaagacc 
cagctgcagg 
ggtgcgggcg 
tcgggcgcca 
cgcgcgcccc 
caacgcgcgt 
gcgcgggcgg 



agatattaca 
aaagaaatgg 
ccatcttcca 
ctgcccaatc 
aaaatttcca 
caacacaaca 
acattaaaaa 
atttctcatt 
aaaatgaaca 
gaacagcctg 
atgctgtgct 
acaaatgaaa 
taaaagtatg 
taggaactcc 
tgcaagttag 
atttcttcag 
ctaggaggat 
tgatagatgt 
cctcttctgc 
cagtgctata 
tttattcata 
tagaaagaaa 
aaaactctaa 
catcataagg 
ccatgtaaga 
cagggagact 
tctgacaagg 
tggaaagcac 
ttttctgcaa 
cctagaattc 
tattcagaaa 
aatgtgctta 
acattttata 
acaatctaac 
cgggcgcggt 
cgaggtcagg 
tacaaaaact 
ggcaggagaa 
gcactccagc 
tgaaatcatc 
tcccacaggt 
gttttactct 
ctgttggtag 
agagatctat 
tttttaaggt 
cctggactgg 
aattgagggt 
atttctcatt 
gtaaaggctc 
gaacataacc 
gagccatcag 
agYccactgg 
ctccctaagg 
tccgctaaat 
aaacaccatt 
atctccctgc 
aactttgggt 
tcggctgcac 
aaccccattg 
tgcgcgccgg 
tcgcccgcca 
gccgaactcc 
tcccgcgcct 
cgctggggcg 
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54121 
54181 
54241 
54301 
54361 
54421 
54481 
54541 
54601 
54661 
54721 
54781 
54841 
54901 
54961 
55021 
55081 
55141 
55201 
55261 
55321 
55381 
55441 
55501 
55561 
55621 
55681 
55741 
55801 
55861 
55921 
55981 
56041 
56101 
561 61 
56221 
56281 
56341 
56401 
56461 
56521 
56581 
56641 
56701 
56761 
56821 
56881 
56941 
57001 
57061 
57121 
57181 
57241 
57301 
57361 
57421 
57481 
57541 
57601 
57661 
57721 
57781 
57841 
57901 



tttccaggca 
gggagccccc 
taggcaaacc 
tcctgtttcc 
acacaattga 
cataaaaatg 
catttaaaag 
gaacttcgtc 
atggctcatc 
cttgctttcc 
gcttggaaat 
taatgtgaag 
agactattga 
gaagagggac 
agcagtatgg 
tttggtgaaa 
gggtgagaaa 
caaagaagtg 
gcaaataatc 
tttcacacct 
ctattcctat 
gggtagcatc 
gcactatggg 
aagtaataaa 
tgatctcaaa 
gaaaagattg 
aagtaaacac 
acatgaatgg 
aagttcaatc 
ttttaaaagt 
agggatactg 
tatttttgta 
taaaaattag 
ctgaggcagg 
ctgtgaatag 
aaatagtgta 
cctaaatgcc 
gtgcagccct 
taagtatata 
ggaatggata 
gctagagtgc 
ttctcctgcc 
aatttttgta 
aaatcctgac 
agccaccaca 
gctggcctta 
gaagtcaagg 
ggttagaggc 
tttatcagag 
tgtctgttcc 
ccctatctta 
gggcccattg 
tttgtgcctg 
agaagagaag 
gcggtgtttg 
acctcagcct 
ttctttgtag 
aaaatgttta 
tctgtgtaaa 
aaagatacat 
cctgagtatg 
tgctgcaact 
aactgagcct 
cctgtgcatg 



gcgcgcgcct 
aggaagtcct 
cactgtgaaa 
gtgtttaacg 
aggttatttc 
ccaggagaat 
tagtttcgtt 
tcaagtctca 
gatgatgtgg 
ttccctggtt 
tacatgttca 
aagtgttaaa 
ggactgtctc 
tgggctcaac 
agttcagtgg 
ctgacctaac 
actgattaga 
cacagatgga 
tttgtcatag 
ttggtagcac 
aaccctcagt 
tgaatcagtg 
cagagagaag 
accataaaag 
atgggaggag 
gtagacttgg 
acaaacaata 
ggaatttcac 
tcactagtaa 
ataaaacatc 
ggaatataca 
gatttggtat 
tataagctgg 
aggattactt 
ccactgccct 
cagaactatt 
caacaattag 
tcatttatat 
tagtacRtgg 
cttttgttgt 
agtggtgcaa 
tccgcctccc 
ttttttttta 
cttgtgatct 
cccggcctgt 
aactcccaga 
acaagtttat 
acactctctt 
gcttggactg 
catacatctt 
gtgcacctga 
tatgagggtg 
agctgtctta 
tgatttcctt 
tccgagatga 
ccagactaac 
tatgcattcc 
aaatagttat 
agttctacat 
ataattttaa 
acacataaat 
aaaatcagac 
attctctctc 
catgacagtt 



cctgccggcg 
ggcttgaggc 
gtacaaaata 
gccctttaca 
ccccggtagc 
gctgtttgtg 
ttctctaagc 
tacctgagtg 
aattagacgc 
ttaatttttt 
gttagtccac 
gacaaaatta 
aattggcata 
tccgaataca 
atggaaaatt 
aggacttctg 
tatggaggaa 
cctagcagaa 
gcacaattct 
tatctatttc 
agatgaaatc 
gatggattat 
cagaactcta 
agctagcaga 
ctttctaaga 
ctacgtaaat 
tgaaaaggca 
cccaaaaaag 
acaaataaaa 
cagtgtggat 
ctttaataaa 
aatccacttt 
gggaagtggt 
gagcccagga 
ccagcctggg 
catagagtat 
gagttaaata 
tctgattgaa 
cttttttttt 
ttgtttgttt 
tcttggctca 
aagtagctgg 
gtagagatgg 
gcccacctcg 
aatttttttt 
tgtaagaatt 
tttagagaaa 
acggactaat 
cttctgtgtc 
tttgcagctg 
agggaaagga 
aagtttggca 
tctgtgtttt 
gaaatgcatg 
cggtgctcct 
tgggactact 
tggttatgta 
ctttaggtgg 
tgaagtgtat 
aggttttctt 
gctttttgtg 
ctctacatgc 
ctccatacct 
gatgacaaat 



cgcggggaac 
ccggtcttgc 
actttctgtt 
acagcaaaaa 
aattgtcttc 
tagtggttac 
gaaatatcct 
ctgtttgagg 
atagacttaa 
tcacttatat 
tactaagcta 
ttcaatgata 
gggaccactg 
gcgtgggcaa 
actaagagga 
aagacataac 
agggggttct 
gattcagaag 
ttctctgctt 
tgtagtgctt 
caaggaagat 
tcaataaatt 
tctcacacta 
agagggcgct 
agcaaaacag 
gctttttaca 
agttgaccaa 
aaatacaaat 
aaatcaaatt 
aaaagagcta 
gtaatcggca 
tagatttaag 
atgcgcctgt 
gttcaagagg 
caacatagtg 
tatttagagt 
acctatgata 
gtctttatga 
ttttNaacat 
ttttaaatgg 
ctgcaacctc 
gattacaggc 
ggtttcacca 
gcctcctgaa 
taagagacaa 
aaagaagaaa 
acaaacctga 
aagagttttt 
tgttgtgctt 
caggcatacc 
atgtacttac 
gttacccaag 
actgtctgct 
aggctagaaa 
ggtctgtcac 
acaggctcac 
catagaaaaa 
ttaaattaag 
catttttaaa 
ataaaagcat 
atgcaactcc 
tggccacatt 
ttgcatcgca 
gtgtcagcaa 



gcgcagcctg 
cccactttta 
gcggactaac 
agaggaatca 
gctgctgagg 
cgcactagtc 
cgaagacacg 
gacagtagtc 
cgttatttgg 
attttgatga 
acaatatata 
cttgttaaag 
caactgggtc 
gagggaattt 
agcatcagag 
atcacctggg 
ttgctaaaac 
ccgtactaga 
tcccactgcc 
attcactctc 
gaaagggaca 
gtgttcttaa 
attgtgaaaa 
cttttttttt 
caaaggaaac 
taaatattta 
aaaaaaaaaa 
taccagtaaa 
aacaaactaa 
agaaggaggc 
tcaaaaatgt 
tacataatta 
agtcccagat 
gcagtgctgt 
agaccccatc 
agcagaaaca 
gagctttata 
aaaggcaagt 
acatctttta 
agtcttgttc 
tgcctcccgg 
acacaccacc 
cgttggccag 
gtgctgggat 
gatctcacta 
gaaacaggaa 
aaggggcttc 
aaggatttag 
atctgggagg 
cccgagtctg 
taaggcccac 
agaatttccc 
ctttctggct 
gggagctgga 
cagactcaag 
accactgtgc 
aaggttgaaa 
aatgatttcc 
taagaaatta 
tcaagatata 
tgctcacctc 
gaattaattt 
gttccccagg 
tcagaacagg 



cggtgctccc 
tctcaccaag 
gtcgccaggc 
gttgctaaaa 
cttatgttag 
ataagcacta 
gccttggtct 
catgtgacag 
ggaggattgt 
gcaacttata 
gctgtgacca 
caccgaacga 
ttgcagtggg 
atagccaacc 
gtaaggggca 
ggatggtgca 
tggattttta 
gtttggccaa 
tggtccaaat 
ccacatgccc 
attatgataa 
ctgattagct 
caaaatccca 
tttaataaat 
aaccataaag 
aattttatta 
ttagcaaagg 
tacatgaaaa 
gatgtaaaca 
attcttaggt 
ttaaaaatct 
aatacacata 
actcaggggg 
atgattgcac 
tcttaaaaaa 
tctaaaacaa 
agaggatcct 
tatgatgcaa 
ttgactcaga 
tgtcccccag 
gttcaagcaa 
atgcccggct 
gctggtttgg 
tacaggcgtg 
tgttacccag 
aggtggctta 
tggccagtta 
ggtgggagag 
gagagttgtg 
cttttagctt 
tgttttactg 
cccacctccc 
gcttatagtt 
acttaaagtg 
caatcctgcc 
ctgccttgtt 
aatatatgcc 
tttgcacatt 
tgattttgta 
agtccacctt 
cctaggttct 
caatttcctg 
cctgcagttg 
gcctcattac 
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57961 
58021 
58081 
58141 
58201 
58261 
58321 
58381 
58441 
58501 
58561 
58621 
58681 
58741 
58801 
58861 
58921 
58981 
59041 
59101 
59161 
59221 
59281 
59341 
59401 
59461 
59521 
59581 
59641 
59701 
59761 
59821 
59881 
59941 
60001 
60061 
60121 
60181 
60241 
60301 
60361 
60421 
60481 
60541 
60601 
60661 
60721 
60781 
60841 
60901 
60961 
61021 
61081 
61141 
61201 
61261 
61321 
61381 
61441 
61501 
61561 
61621 
61681 
61741 



cttggcttct 
tcatgtcaga 
gttcattttt 
tatgcatcat 
cccagtactt 
ccaggcaaca 
accagaaaaa 
ggctgaggct 
gccactgtac 
gttaaaattt 
accctagctt 
ggaaaaggca 
caccttcatc 
tgttcattaa 
gctaggaaac 
tactggaaat 
tgaattagcg 
taaggaacaa 
tataaagaga 
ataataggcc 
ggtggatcac 
tctacacaaa 
tctggaggct 
gatcgtgcca 
taaataataa 
tgtggatgga 
gtgtgcattt 
aaaatgctta 
tgcaactgtg 
ttgctacagt 
ctgtaaaggg 
aataatataa 
gtaatactgg 
atggaaaagc 
gactctagca 
attaaatagc 
agtggttctc 
ctttatgttg 
ggatgctcta 
ctgaaaagtc 
gcaggtgggt 
taggctaagt 
aagatccgtg 
tgaaacagat 
gccagattat 
ctgtggaggc 
atgaatgtct 
aaaacttagt 
aatttttttt 
ctattctagg 
ttggcttatt 
tccgtaaggt 
acagtgaaaa 
tccacgttct 
ttggtgtgaa 
ttccccaggg 
cagcctgaga 
ccaaactcat 
ttatcacttt 
taatttttct 
aggcgggcgg 
gtctctacta 
tcccagctac 
agtgagccga 



aagtacagta 
gttccattgg 
cctatgtatt 
tagaggctta 
tagcagacca 
tagtgagatc 
aaaattagct 
ggagaattgt 
tccagcctgg 
agaagaggaa 
tttaaataaa 
ggattgaacc 
ctgtattttt 
gttaaattca 
tcattcattt 
atggaaacag 
tgatgtacaa 
ttgtgttaaa 
tgtttaaaat 
gggcacagtg 
ctgaggtcag 
tacaaaaaac 
gaggtaggag 
ctgcattcca 
aataaggttt 
agtgtaggca 
ttgtaaaaca 
tttaYagtta 
agtattaaag 
cataaaaacc 
tatgcgaggg 
agtttcaaag 
gttcctgctc 
atatgagaaa 
attcataaaa 
aggaaagaaa 
aattgggggc 
gcataaatgg 
aacatcctac 
aagagtgcag 
tattcttgaa 
tgagagtctg 
gccgaaagag 
tatttcactc 
agtaaatctt 
acagaagacc 
tgaagtggct 
attcttatgg 
ctcaaaaata 
cccagctcat 
catctttgca 
acactctcaa 
caaattacag 
taatgaggct 
gcagaagtgg 
actgtgtctt 
aagaggatat 
ttcttaccag 
tcacacttaa 
tggccgagca 
agcacgaggt 
aaagtacaaa 
tccggaggct 
gatcgcacca 



ttgtgacagc 
agagctgaga 
gaaaggaaat 
aagtctgttt 
aggcaggagg 
cccattctta 
gggcatggtg 
tcgagcctag 
gtgacagagc 
ggaagggcag 
acctaaaata 
tctgtgaaat 
taattcttat 
gtaaatgctt 
ggttgataaa 
aatatgtttt 
ataattacag 
gattgggagc 
aagattttgt 
gctgacgcct 
gagttcaaga 
ttacccaggt 
aatcactaga 
gcctgggtaa 
gagtactgtg 
tttcatacaa 
gcagaacatg 
ttcatcccgg 
Ktatttaaaa 
aaagcattta 
ttcactatac 
aaagtactca 
ctaaaaccat 
gtgtccaaat 
tttatatgag 
ttgctttggg 
aattttgccc 
attgggggga 
agtgcacagg 
ccattgagaa 
gtctacagct 
gtctggggaa 
gctgaagaac 
agaaaattat 
gaaaaccagg 
tattaggaag 
tctctacttc 
agtggactaa 
tattttagtc 
cttaatcttg 
aattgagcac 
gtgagggtgg 
agtgatttct 
ccagtaagtc 
tgcttcacag 
ccaccttccc 
gagagttttt 
actatcctca 
actctcaata 
cggtggctca 
cgggagatcg 
aaaaaaaaaa 
gaggcaggag 
ctgcactcca 



ttgggaggat 
agcaaacaag 
gcagaaactg 
aggccaggcg 
atctctggag 
cacacacaca 
gggcacatct 
gagtttgagt 
aagaccctgt 
gagggagaga 
ccttaaatgg 
ttgcccattc 
gctcttttgt 
gtttagtagc 
tatgtgccag 
ctgcctgcaa 
cacagtatgc 
atttaacttt 
tcactagtat 
gtaattctag 
ccagcctggc 
gtggtggcgg 
accctggggg 
cagagtgaga 
tggacaatca 
ggaagaaata 
tttagggaat 
cactaagaac 
attgctaaag 
caagattgac 
tgtctacttt 
ccaaaattgt 
ataatccagt 
catgaaagga 
aagtgtacag 
aaaggtaaag 
cccagcaaca 
tgctaccggc 
tcagcatccc 
atcctgagct 
tcaagagctt 
agggtccaca 
caggaaaacc 
caggaggtga 
cagagatagt 
ataatggcag 
aactgcccaa 
actatatcca 
tttctttccc 
gacaactgca 
tggatatact 
gaggagaagg 
cattagtccc 
tgtagactgt 
taagtgttgg 
cgcagccaca 
tgaagtctac 
ggctaaattt 
agtcccaact 
cgcctgtaat 
agaccatcct 
aaatgccggg 
aatggcgtga 
gcctgggcga 



gattggctgg 
cttagcagct 
aaagattccc 
cagtggctca 
accaggagtt 
tacacacaca 
gtagtcctag 
ttgcagcgag 
atctaataaa 
aggaggaagg 
agctgggagt 
attctttccc 
gggtagtaat 
tattatatgg 
gctaaaWgtt 
gtagcccatg 
taagtgcaat 
cagaattcag 
agtggcaagt 
cactttggga 
caacatagca 
gcacctgtaa 
cagaggttgc 
ttctgtctca 
ccaagggagc 
attataggtg 
ttcaaagagt 
agtttagata 
gtgtttttct 
tgagtggctc 
tctatatctt 
gtgtttaaaa 

tggcgatgca 
agaaaggaga 
cttaaaaagg 
aatgaggaaa 
tttgggacta 
atctcagggg 
acagcaaaga 
agatgcacag 
cctaaaMccc 
gcttttatct 
aaggggtcag 
ggctgggtct 
tggacatcaa 
gacaaaaact 
gttcccttca 
atcagtcacc 
cgcttatcag 
aaaccttcct 
tcctggcatg 
ttgtagaatc 
attatgggga 
ttggggctga 
gataaatcca 
ggaaggtaat 
aaacctgtta 
ccttaaatac 
ctaaatctaa 
cccagcactt 
ggctaacatg 
cgtggtgcgg 
acccgggagg 
cagaacgaga 



ggttccatgc 
ttgcagatta 
aaatccaaga 
tacctgtaat 
caagatcagc 
cacacacaca 
atacacaggt 
ccatgattat 
acaaaagtct 
gaaaaaactt 
ggggatccag 
actctaatct 
ttacattctt 
aaattattat 
aggtgctagg 
gactggtggg 
aacagaagtg 
gaaaggcttt 
aaaaaaaata 
ggccaaggcg 
aaaccccgtc 
tcccagctac 
agtgagctga 
aataaataaa 
aaagtggaga 
tttttatatg 
gtggtataga 
tttaataaat 
taaaaaatag 
actgctgaag 
tgaaactttt 
tatgagatct 
acagaagcaa 
ggtccctatg 
tagggtaagg 
gagctagacc 
tctggagata 
taagggacag 
ataatctagc 
actaYatttg 
ttttcaaatg 
gattcttaaa 
ggaccttggc 
gtttgctgga 
cctgaggtct 
gtgtcacaat 
atcaggctgc 
aatccatatt 
ggtcaacacc 
tagtgagtcc 
tggcagccat 
cccccaagtt 
acatgcctca 
gggtggggca 
acaagctgag 
cacggataat 
tttgccgatt 
caagtttcga 
attgtgcagc 
tgagaggccg 
gtgaaacccc 
acgcctgtag 
cggagctggc 
ctccgtctca 
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61801 
61861 
61921 
61981 
62041 
62101 
62161 
62221 
62281 
62341 
62401 
62461 
62521 
62581 
62641 
62701 
62761 
62821 
62881 
62941 
63001 
63061 
63121 
63181 
63241 
63301 
63361 
63421 
63481 
63541 
63601 
63661 
63721 
63781 
63841 
63901 
63961 
64021 
64081 
64141 
64201 
64261 
64321 
64381 
64441 
64501 
64561 
64621 
64681 
64741 
64801 
64861 
64921 
64981 
65041 
65101 
65161 
65221 
65281 
65341 
65401 
65461 
65521 
65581 



aataaataaa 
gtcaataaat 
atccaacaat 
ggactgtatt 
cctctcctca 
cggggcgggg 
tgacgcgtgc 
ggtaggaaca 
ttcattcgat 
caagaggcaa 
tctctcggga 
aaggtgcggg 
ggtgaggccg 
attttcttgt 
gagctttttt 
ctgggttttt 
tgcgggagcg 
gcccagagta 
gacatcttta 
ttttcttttc 
gtgcagtggc 
tgtctgagcc 
atttttagta 
aagcgatcct 
ccggcctaca 
ctagtaagta 
gtggaaagta 
gttctgagaa 
caatgtaaaa 
ttatcaagtg 
tggtaggttg 
ttagtactgc 
ggactgacat 
atgggtgtac 
tagttatgtt 
aacatgctga 
gtaagctaat 
aatctacttg 
ggaccagcct 
ggctgccagc 
aatgatgctg 
tgtaactaga 
tgataattgg 
aagcaccatc 
aggagacatt 
ttttcagctt 
gaatcttctt 
ccctctgaaa 
tatgcagtac 
ttgaaggatg 
acaacaatgc 
aatagagcta 
aattagcaaa 
atatgaagga 
aacagcctga 
tcaggttaaa 
ctggtgtgag 
tagtattagc 
taggagactt 
tcattgaaac 
aggtcaagac 
taaggaattg 
tttacaaatt 
cctcttcctc 



taaataaata 
aaacacgaga 
ctctgaagcc 
tcaaagcgat 
ggtgtagccc 
agttctctcc 
cagagctgcc 
tggcggatcg 
tagcctctgt 
aacttgRaag 
ggcgccaccg 
agggagcaga 
ccaggacttg 
cagtggtgct 
ctttcctcat 
gtaatgaaat 
atgctgaggt 
tccttccatc 
tgataggaaa 
tttttttttt 
gcgatctctg 
aaagtagctg 
gagacggggt 
cccgcctcgg 
agtccttttt 
gctccccagc 
gaagcaaagc 
atgcatggtt 
taacgataaa 
tcatggactg 
atttacagca 
tacgatgtca 
aatatgtgca 
ttattaccat 
tctcattgtc 
taggaaaacc 
aacttggcat 
tgataaaact 
ctaagtgttg 
cagccttcct 
ctggatagaa 
aggataataa 
attttggaga 
atgtttacaa 
taagtgatgg 
caggttctga 
agcatttatc 
accgtgagtg 
tttagttatt 
gaaaagatag 
tataatagga 
ggatcaggat 
acctgggcaa 
gtagtggaaa 
gttggagatt 
ataaataacc 
ataattggtt 
agaatatgaa 
tcaatacaaa 
aatggcagag 
taggtgagag 
ctattcccaa 
gagtcttgaa 
actatttctt 



aataaataaa 
gggtacgaga 
tttaacacct 
tggaccccag 
gctctagaaa 
gaccccaccc 
cagtcgcgga 
gctcacgcag 
ctttctcttt 
tggggagcgg 
cgaactcgga 
ctctttcgct 
tgggctgcgc 
tacgggttct 
caattttaat 
gtcagccgag 
gaggtgagag 
tcttgcctct 
tcttctaatc 
tttttttact 
ctcactgcat 
ggattacagg 
ttcaccatgt 
cctcccatag 
aatgtctccc 
tttgttcctt 
cagaggatgc 
ttggcgataa 
acagtaatac 
cacataattg 
gcatcaccac 
ctaggcaata 
tcccgttgac 
tacgagttag 
tgattgtgat 
taagtacctg 
ttttcttaat 
gatctaccct 
aattcaccaa 
cagggaacta 
atcttatttt 
ggacagagtg 
aaagccttta 
agagatactc 
aaaaacagta 
tgtaaattgg 
ctaatcagaa 
cagaagctat 
ttccataaat 
acactgtgtt 
gaatgtatcc 
ctaggtttgt 
agccatttag 
acgagtcagt 
tatcttgaag 
tggtgaactg 
gaactaagca 
gatcaaagga 
ttaaatatta 
aaacccagtg 
aaattggtag 
tctttaaaag 
caaatacatt 
tacacttgta 



taaaaataaa 
aattacttaa 
aacaacccca 
accattcccc 
ggccgcctcg 
gttccaagga 
agagcagctg 
cttcaggacg 
cttgaggtaa 
actgagagac 
ggtttgaggc 
tgaggagacc 
ggtcctgtcg 
ctttcttttg 
ccaccaaggg 
agtattcctt 
attttctttc 
tggccatccc 
ctcgtatgtc 
ttggagacag 
cctccgcctc 
cgccctccac 
tgtccaggct 
tgctgggatt 
ccaaattcta 
ctggttattt 
gcagtcatgg 
cttttttttt 
ataaaccagt 
tatgtgctat 
aagcactaag 
agaatttttc 
caagacgtca 
attgttttct 
ttaatgtcct 
gtttccttgc 
acatcagaat 
gggtgaataa 
gataaattct 
ctcctttgag 
agggaaaatt 
aatatggcct 
ggataactag 
ctaaaggagg 
tgtggttagg 
aatatggagt 
ttatttgtat 
gtcgcattca 
ttttgttgac 
gaatattttt 
aatttaaaaa 
atagcatctg 
tcataagtga 
agagttgact 
acagtggtgg 
gaaaacaaca 
acagtgatag 
aagtaagaat 
aaaagaggaa 
gaaatactta 
ggaagatagg 
tacttaagga 
tctgcttact 
actactccag 



tatatttact 
aaaaaaaaaa 
aggggagtat 
cagttactca 
ggagtaccac 
cgcagccctg 
ttttggcgtc 
ctgtgaattc 
agcaagggag 
agggcttcgg 
cggctgccgg 
gggaagatcg 
tgttccctga 
ggggttggga 
agatctcatg 
caccagcgct 
atcttagctc 
tcagaataca 
tttccgctta 
tgcctttcgc 
ccgggttcaa 
cacgcccggc 
ggtctcaaac 
acaggtgtga 
ctctctatta 
ttcctccaca 
gcccctaatg 
tgttttaata 
aacaaagtca 
gctttcatac 
gaacgcattg 
agctttatta 
gtgggtattc 
gctaaatagt 
agataagggg 
cagttttttt 
ttgtacactt 
cactagtttt 
accctgcagg 
gttttacgaa 
ttgcatattc 
taactccagg 
tatgagtgcc 
ttaaaaaatg 
gagttataat 
gatgtgaacc 
aacttttaac 
ctatgtctct 
taaatggatc 
ttgtgtgtgt 
tgaaaaatgc 
agcacttatt 
ctttatgact 
tagattattg 
agtaagctta 
agaagactat 
aaggaaaaga 
ctgaagttag 
acagttgcct 
ttagaaatag 
tgtatttaat 
atacttttag 
taaggtgctt 
tctcagaaaa 



cttcaatttg 
tccccgcaga 
ttcggctctt 
aaagatccag 
ccactcctga 
tgattggcca 
tgtttgctgc 
ggtgaggaat 
gaaaggggcc 
cataaagcgc 
gcggtgcttg 
ttcttgtccg 
ggacagtttt 
ggagaagcag 
ctctggatga 
ttcaggttgg 
gttcctttgg 
tcagtttggg 
tttgagtctt 
ccaggctgga 
gcgattctcc 
taatgtttgt 
tcccggcctc 
gccaccgtgc 
aaggccaagt 
ctatgcctga 
aaggggtcat 
agtagaagta 
tttagtatca 
actggcagtg 
cgctacagcg 
taaccatagg 
ccatgcggca 
tttccattga 
ctagagagaa 
tttcctcact 
gaagagtgtg 
taagctgaga 
gacttgcctg 
atcaaattgt 
atacatgtct 
ttaggcataa 
tactgtgtgt 
tcaaatactc 
gaattgagta 
aaaatagggg 
atttgttcta 
gtagtgttta 
ttaaaagctt 
ttcctctatt 
tgcttggatt 
gctccattac 
agttagataa 
aaagcttcaa 
aatgttttga 
tgtaataatt 
gaaagatttg 
attggttgac 
ttgagttctg 
gaggtttgag 
ttggagatgg 
agtacaagag 
ttctgagcag 
ggaatcatat 
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65641 
65701 
65761 
65821 
65881 
65941 
66001 
66061 
66121 
66181 
66241 
66301 
66361 
66421 
66481 
66541 
66601 
66661 
66721 
66781 
66841 
66901 
66961 
67021 
67081 
67141 
67201 
67261 
67321 
67381 
67441 
67501 
67561 
67621 
67681 
67741 
67801 
67861 
67921 
67981 
68041 
68101 
68161 
68221 
68281 
68341 
68401 
68461 
68521 
68581 
68641 
68701 
68761 
68821 
68881 
68941 
69001 
69061 
69121 
69181 
69241 
69301 
69361 
69421 



actttgtgca 
atttgctctt 
ggccaatatt 
ctgtttcaac 
ctttgcagaa 
cactcctttc 
gccattacca 
tggcctaagc 
cctgcctctt 
gaaggtaaac 
taaaattaga 
aatattacaa 
ttctggttca 
gctgagtgag 
ccactatatt 
aaatgtgtaa 
cggagtctcg 
tttcgcctcc 
ggctcatgcc 
gttggtcagg 
tgctgggatt 
aaaagctata 
tttctaatat 
tgattgcacg 
ctacagctgc 
tttgaattaa 
aacatacctg 
ctgggaagtc 
aaacatggtg 
gaaggcagaa 
tactaattcc 
ccctcttaat 
attcaaatca 
ttttcctcaa 
tggagtagaa 
gatctctaat 
gtctgatttt 
atttaaaaat 
tgtgtcttat 
catgtctgga 
ttgctgatat 
cagactcata 
agaattctgc 
atgacacatK 
ataaaatcat 
ttattaaggc 
tatgacataa 
tctctcaagc 
agtgtcctgc 
gaaattgtgt 
gcaaatgttt 
ccaagagatc 
gaactatttt 
acaatgaagt 
cttttgccct 
ttccactaca 
ttgtgtcagc 
gataatctcc 
tggggctgta 
gtttaagttt 
tcaagctttt 
agctgtaaaa 
tattttaatt 
tcttcattat 



tgagaaataa 
ttccatcact 
ctcttatcat 
tagatttata 
ttaaaacgtt 
tgttttattc 
ataaagtcct 
ttgcagatca 
tcaataatat 
aggttttctt 
tttcttgaga 
gttttgtttg 
tgtcagataa 
ggaccttaac 
gtcataataa 
caggattggt 
ctctgtcacc 
cgggttcaag 
accacgtctg 
ctggtctcaa 
acaggcgtga 
tgaagtttct 
cctaatctag 
aacagcaaaa 
tttacaggta 
aggaggtaga 
agacagtaat 
caagatcaag 
ccatgtttag 
gggcaaaaga 
atctatgaga 
accatcacat 
tagcaccttt 
ttttgtctct 
taatggtgtc 
tgaaaagtca 
tagaatttct 
aagtacatct 
aaaggctgct 
ggatgttgtt 
tgcacagtca 
gcatcagtgg 
atcagactta 
acctttttag 
gattgaatca 
atgtaataca 
tttatgtctc 
caaaggagaa 
ctcttagagg 
tgcttttaag 
cttctgctag 
ctctgtcaag 
tcctatttct 
atgttttttg 
tattaacctc 
tcagttactt 
attttcaact 
actggaaagg 
tttttcagaa 
caggagaaaa 
ccacagttct 
gttgcagcaa 
gatgtttatt 
agttttgggg 



atttttatta 
atccccaaaa 
tttactatca 
tcaaatttaa 
taatcaacat 
ccttgctcag 
taaaactgat 
gttttgtaat 
tcagacagca 
agcttctctt 
aattcagctt 
ttgtaaaaaa 
tagattgtta 
aaggaaggca 
atgaaattac 
gtatatttta 
taggctggag 
gaattctcct 
gctaattttt 
actcctgacc 
gccaccacgc 
ggtttcatta 
cagtatcttt 
gacattgatg 
agcctctatt 
tttagtacct 
ttataaagaa 
gcatttatta 
ggccatgttg 
gcaagagctc 
tctctgcctt 
tggggattag 
tgttataata 
tttcctagtg 
taaacgtaat 
ttctttttaa 
ttactgaaaa 
gtgacagctt 
agcttgtata 
tatcgaggag 
cagctgaaga 
ataccatgtg 
gatacaagcc 
ctatttttaa 
gctttaaagc 
ttaatgaaca 
cattttgttg 
gacagtaaga 
tggcattgtg 
aaatagagtt 
aacctcatac 
agaatttcct 
tttcaccttg 
aatcatcaat 
tcaactttta 
ggcatcattt 
atggttattc 
tggaattgaa 
tatgctgagc 
caggaacaat 
tgaaaagtac 
tttattggct 
aaaaacactt 
ggagagaaga 



aacatggtgc 
agtctttact 
caagaaaaga 
ctttacctag 
taataataac 
agattatcat 
aagtcctttc 
gccattggag 
attaacaaag 
agtttgactc 
attttatttc 
tttacagtca 
ctcctttaga 
tcccatttat 
atacattatg 
atcattagaa 
tgcagtgatg 
gcctcagcct 
gtatttttag 
tcgtgatccg 
ctggccgaat 
acttaaaatt 
ctctagagta 
ttttgataga 
cctttgagaa 
tttgtctgtg 
cagaaattta 
ctcactgtgc 
agggccttct 
cctcttcaac 
catgacttaa 
attttaatgt 
ttcaaatttc 
aagaatgaga 
gtgtaaggga 
taattcacat 
acatatcaca 
ctctttctaa 
agctagaaga 
acatgcttct 
caagaagtgg 
gctgagaaaa 
ttaccaacaa 
tagtcttcta 
atcataccat 
taatataagg 
tattggccag 
acagacataa 
taaatctgag 
agtgtggctg 
ctgttctagt 
ggctgttgtg 
ctttttttca 
tctttctcat 
ttgctgtatt 
cacctcagtt 
atccaaccct 
atgtggtcca 
tacatttgct 
agaacactct 
tatgtttcaa 
agtcatagaa 
tgctatcaga 
ttcagtcaga 



catagatatt 
tgtggcttac 
aagattattt 
attgatacca 
tggcttattt 
tacagaaact 
taaccttgat 
tattgcagca 
accagccagc 
tcacattttt 
taaaattatt 
attcacctaa 
ggagaatttg 
aaatttcttt 
cttaattccc 
tcttcttttt 
caatcttggc 
cctgagtagc 
tagagatggg 
cctgccttgg 
cattagaatc 
tgtttctgaa 
tgcccagctt 
ttccttaccc 
ttttaccagt 
tccatttgtg 
ttttctcaca 
atgagtaatg 
tcctgtttcc 
ctcaagccat 
tatcctctta 
atgaattttg 
cattataagg 
ggagggtggc 
gtcacttaaa 
ggggaaaatt 
atgattgcaa 
actctaaaga 
agaaaaccat 
ggagaagata 
tacccatagc 
gaactgtttg 
ttacagaaac 
ttttcactct 
cattttttaa 
aaacatatgt 
tacttttaca 
gggacatttt 
ctttgagcaa 
gataagaaag 
tcatctccac 
aaagaatttt 
tccttaattt 
tctccattta 
ctagtcagac 
tattattgcc 
tggatctctt 
cattttaact 
gcaatctgtt 
gcctgttatt 
atttcaggaa 
aatttttgaa 
tatttggcat 
aaacttattc 



tacaagtcaa 
aataagtatt 
tgtatttcaa 
aacttagttt 
tatcaacaat 
tattgttgtt 
atgttttctt 
atgtggtcct 
taaccctaca 
tgtaatcctt 
gttgtttgag 
catctcataa 
tattaattta 
attcaggtga 
aatttgagct 
ttttttgaga 
tcactgcaac 
tgggattaca 
gtttcaccat 
cctcccaaag 
ttagggcaga 
aacttgaggt 
tttgcagcac 
agtgaagaat 
aatagagaat 
ctgctataac 
gttctggagt 
aatttttatt 
tcacatggct 
attttaaggg 
aaggcccctt 
gaggggatgc 
aaacctctta 
attctacaat 
gatatctaga 
cttaaaagtt 
agtcatccag 
ttttaaaatt 
gaagctgcta 
caaagcgcac 
cagtctcttc 
agtgccatta 
attaaacact 
tgataagctt 
ctgagtgaaa 
aaaattctgt 
aatcaaaaca 
agtttRggct 
gaaagttttg 
tcacatttat 
gtttatttta 
tctacatcct 
gcttgtcatc 
tctgaaggtt 
cagaacatac 
aaataataat 
tgagYctact 
agctatttcc 
gctattcaga 
tttgttgtaa 
caccagcgtt 
cttttaactg 
aaatctgtac 
aaagtaccta 
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69481 
69541 
69601 
69661 
69721 
69781 
69841 
69901 
69961 
70021 
70081 
70141 
70201 
70261 
70321 
70381 
70441 
70501 
70561 
70621 
70681 
70741 
70801 
70861 
70921 
70981 
71041 
71101 
71161 
71221 
71281 
71341 
71401 
71461 
71521 
71581 
71641 
71701 
71761 
71821 
71881 
71941 
72001 
72061 
72121 
72181 
72241 
72301 
72361 
72421 
72481 
72541 
72601 
72661 
72721 
72781 
72841 
72901 
72961 
73021 
73081 
73141 
73201 
73261 



agtattataa 
ggaaaactaa 
ttaaaaagaa 
aatgtatagg 
gaaagaaaaa 
tgttgttctt 
tctgtgatca 
tcttcagtaa 
tttttttaga 
gatattaata 
cacagatttt 
aatactaaaa 
taaattttat 
taaatctaaa 
agacctacag 
cagttctagg 
attaccatct 
aatctgtcca 
agccaatttc 
gatcttggct 
ccgaatagct 
gagacggggt 
cccgcctcag 
gccaatttca 
gcctttcaag 
tcctctggat 
accccaaaat 
atttcacttt 
gactacaagt 
tattgctttt 
tgtgtctctg 
tgtagccatg 
gaattagcat 
gaggataaag 
gtagtgccag 
gctgtgtgga 
atgaccttac 
ttccctggtc 
taggttgaag 
attttctgga 
gtgcctaagt 
gggggctggg 
cttcaggttg 
atagatatcc 
gaggacatgc 
tccctgtctc 
ccctgtgctt 
aaatggctta 
aagccagact 
tgtacccata 
ccttccccgt 
gaatagtccc 
ctSagctgtc 
tttattcctc 
ttgcagatct 
atagttcctc 
ataagctgat 
tcaaagatca 
catggtagaa 
aggcctttac 
ttaggatttc 
actattttaa 
tttacttgac 
caggggtaag 



aggagtcaaa 
caggatttat 
gggaagttat 
gtaaatataa 
aattaacgtt 
gtttctgttt 
tggctcactg 
ctgggactac 
gatggggtct 
ctatacagta 
tccctaaaaa 
aagattcagg 
tttatttcat 
aactggataa 
tttgcttttt 
ggcaccacta 
tgctctgctg 
cttgtgcctt 
tttttttgag 
caccgcaacc 
gggattacag 
ttctccatgt 
cctcccaaag 
atgggagtat 
aatggtacct 
ccttccagag 
ttctatgtcc 
gatttaatat 
gataagtaac 
taatccagtc 
ctaccacttc 
aaaaaaaaaa 
ttacacaaaa 
aaaacactaa 
ttaagtcoag 
tatgactgtt 
tgcctggtac 
cctctttctt 
ccatattctc 
agcctatggg 
tactatcgta 
caaggtaacc 
gcttgatgat 
ttatacttgt 
taaatgtaga 
ctgcttatgt 
ctggaaaaca 
ttgcaaagaa 
cagaccctcc 
gactaatctg 
aagccactgg 
tccttttccc 
ccattaacgt 
cctgtaagag 
tacctttgaa 
tctccctgtt 
agcttataga 
aagtgctggc 
gaagcaaggg 
cttcttgacc 
aacgtacaaa 
taatcatttc 
ctcattcact 
gttggggcaa 



aaggtacaaa 
cactagtaaa 
agcagaagga 
tagacttctc 
gttgtatgtg 
ttttaaggtg 
cagcttcacc 
aggcatgtac 
cactctgttg 
gactataaga 
actacagatt 
taacccagga 
taatacagca 
gaagctgtgg 
tcatatgtta 
tcagttaacc 
ccagttatgg 
aactacatcc 
atggagtttc 
tcagcttctc 
gcaggcacca 
tggtcaggct 
tgctgggatt 
cctttactat 
cctgccgatg 
aagataggat 
ttgataccat 
aagtaaaccc 
ttttaccata 
agatcagaga 
tggtagcaaa 
tgctactaca 
tatgggagga 
ccacatcacc 
gtcagccaag 
ggattagggg 
ccagctttga 
cttttaccca 
tctattcttg 
aaaggaggaa 
actggtccag 
agtgaggtgg 
gcattgtgat 
aattccccta 
gaggtagtag 
ggtacatggc 
gtgacagtta 
ccactagaac 
aaatcccatt 
gacaaaatgc 
actttgatcc 
ctcctgaaaa 
cacctgctcc 
aaaaagagaa 
gcattctcct 
tgtattcacc 
caatataaat 
aggtctggtg 
aaactcttgg 
tagtcatttt 
tttgtggggt 
aaatagagtc 
tgggtaggaa 
aaagagcaga 



gagaaaaggt 
cctgctctaa 
aacttagaat 
ttgaggtttt 
attctctgta 
aagtctctgt 
ctgggttcag 
taccacgtcc 
cccaggctaa 
aattaagtat 
attttcaaaa 
acgagaaaca 
ataattaaat 
cttgtaatta 
agaaaagttt 
ctaaagattt 
taaccaaact 
taactcccgt 
actcttgtcc 
gggcttaagt 
ccatgcctgg 
ggtctcgaac 
acaggcatga 
cattcctgac 
atttttttta 
gtgagtgagt 
tccctgtata 
tgtgtctggg 
gactgtcaat 
acaaattcca 
acatgtcagt 
aacattacag 
gttagggaag 
ctaaagccct 
gcgtgcgtac 
gttctcaaag 
gaggaccaaa 
tatctaagtt 
aaggaccaca 
tgaaaagctt 
taaggagaag 
ggagataaat 
catggcttct 
tacacaaatg 
tatgacatgg 
cactgactac 
agaaatccct 
tgtgagtaac 
ctttgcctca 
cctttaactt 
accttcagcc 
ttgactagcc 
tctcctccac 
aagcctattt 
tattgcaata 
cattcaggct 
ttacttccca 
tctggtgggg 
gcctctttag 
tcaaaggctc 
tgggagggat 
attctatacc 
aaaaatctca 
ggcactcatt 



caagacattt 
aagaattcaa 
ggcaggaata 
aaaaattaca 
gaggatatac 
cacccaagct 
gtgatcctcc 
agctaatttt 
tgccaaactg 
ttgtatcatc 
gcattaataa 
gaataaaaca 
gtcaatgatc 
aaaatagacc 
aatatcctgc 
ctgtgggtga 
ctctttgttt 
catcttcatt 
aggctggagt 
gattctcctg 
ctaattttgt 
tcccaacctt 
gccactgcac 
ctgtgatatt 
aagccttaat 
aagttatgta 
ccaaggaagg 
cattagttaa 
taccttggca 
gatttccatc 
gtgaggtaga 
gaataaggaa 
tgaaagaatg 
tattgctaaa 
cccttggccc 
ccaaaatgga 
aacgacggat 
gtttaaagga 
gactctacgc 
cgtgatgtct 
gggtatctga 
agaacttggt 
acttcagatg 
ccagcaagaa 
aaaacactcc 
agtattaacc 
ccttcttgtt 
accttcattt 
aaataattag 
gacccaattt 
tgagccagca 
accaggaagg 
gttagttttt 
ctatctgatc 
gtctccctga 
gctatatata 
cagttctgaa 
gcaccttctc 
aggtactaat 
gacccatcat 
acaagcattc 
tagtctggat 
tggactctca 
tactgtgagg 



ttcaaatgag 
gggaagcttt 
aagaaggcat 
tttgttattt 
agtttttttt 
gaagtgcagt 
cacttcagcc 
ttttttcttt 
ctaacctcaa 
cctagagcga 
attaaggtgg 
atagtaaacc 
taagctgatt 
tttattcact 
ttctctattt 
aaccattttg 
ctggatgcta 
gaatttaagg 
gcaatggcac 
cctcagcctc 
atttttagta 
aggtgatcca 
ccggccagga 
ggccattaga 
gaagagagtg 
taataatttc 
gtattctggc 
caaaccaagt 
atggaaacat 
cttaagattc 
gtcaatggaa 
tttgatcgaa 
aaaggagaaa 
tgaggttatt 
cacttggaaa 
tttatcagca 
gtgtcttgca 
gaaaaatcct 
ccctgttgaa 
ttgaatacag 
atttcaactt 
agctaagacc 
tgttgctgga 
tagtcccatt 
ccaacttgtc 
tttacttgag 
acatttcagg 
acctatgata 
ttgaactatt 
tatgcttctc 
ctggactgtg 
acattccctg 
ttctagcctg 
tttgagatgc 
ccctattgca 
gcaaaatacc 
gcccgggaag 
actgtcctca 
cccactcatg 
cattcgtatg 
agagcatagc 
ttttgtttgt 
ttgatagccc 
taagaaatct 



93 



PATENT 
Docket 524593009100 



73321 
73381 
73441 
73501 
73561 
73621 
73681 
73741 
73801 
73861 
73921 
73981 
74041 
74101 
74161 
74221 
74281 
74341 
74401 
74461 
74521 
74581 
74641 
74701 
74761 
74821 
74881 
74941 
75001 
75061 
75121 
75181 
75241 
75301 
75361 
75421 
75481 
75541 
75601 
75661 
75721 
75781 
75841 
75901 
75961 
76021 
76081 
76141 
76201 
76261 
76321 
76381 
76441 
76501 
76561 
76621 
76681 
76741 
76801 
76861 
76921 
76981 
77041 
77101 



gtttcagatc 
caatactaag 
tcagaatgta 
gtaggagaaa 
acaagctaag 
aagaaaacaa 
gctgaaaaaa 
tccttcaaac 
cacacctgta 
ttcaagacca 
gccaggcRtg 
agaatcggta 
cagcctgggc 
aaatttaaat 
atcacagttc 
cgagtagctg 
gggttttgca 
tcgacctccc 
gtcaaaactc 
cgtgatctcg 
ctcctgagta 
cgtagagaca 
tccgcctgcc 
tgtcaaaact 
tgggaggatc 
acaccagcct 
gcagtaataa 
tgtgagtttt 
ctcttgtcaa 
ttttttggat 
gtacatgtgc 
gcacccgtta 
cccccacccc 
actgttgaat 
agtttgctga 
tcattgttta 
tctatcattg 
atagacatac 
cccagtagtg 
cacactgact 
cctatttctc 
ctaactggtg 
gatgatgagc 
tgttcatatc 
ggagttcact 
tctcccattc 
tctttagttt 
tttcagacat 
ctagggtttt 
ttgtataagc 
cccagcaccg 
caaagatcag 
gatttatatc 
atagtttgaa 
tggagatgtg 
gtgaagaaag 
agtatggcca 
ttgtttgtat 
ttcacgtccc 
gggagttcac 
gatttttgta 
agattttggg 
gacaatttga 
gccctggcca 



accttgtgtg 
agataaaact 
atgaaaacaa 
atgcaagtat 
tatctaactg 
ccatactgta 
gagttttgac 
ttaatccttt 
gtcccagcac 
gcctggccaa 
gtggtgggcg 
cagcctggga 
aacaagggca 
agataaacaa 
attgcagtgt 
ggatggcagg 
gtgttgccca 
aaagtgctgg 
ttttttggaa 
gctcactgca 
gctgggatta 
gggttttacc 
cctggcctcc 
cttaattagc 
gcttaagcct 
ggtgacagac 
aaccttctag 
gctaaatatg 
aattttacag 
taaggtgatt 
acaatgtgca 
actcgtcatt 
acaacagtcc 
tcccatctat 
gaatgatgat 
tggctgcata 
ttggacattt 
gtgtgcatgt 
ggatggctgg 
tccacaatgg 
aacatcctct 
tgagatgata 
attttttcag 
ctttgcccac 
gtagattctg 
tgtaggttgc 
aattagatcc 
gaagtccttg 
tatggtttta 
tgtaaagaag 
tttattaaat 
atagttgtag 
tctgttttgg 
gtcaggtagt 
ggttcctttt 
gcattggtag 
ttttcacaat 
cctcttttat 
ttgtaagttg 
tcatgatttg 
cattgatttt 
ctgagacaat 
cttcctcttt 
gaacttccaa 



caattcagga 
caggagtcaa 
actatgcata 
tacctttctt 
aaagagaaag 
gaaaaaacaa 
tattgattga 
aaacttcaaa 
tttgggaggc 
catggagaaa 
cctgtaatcc 
ggcggaggtt 
aaactccgtc 
ggtctcactc 
tgacctcctg 
cttattccac 
ggcttatctc 
gattacatgc 
acggagtttc 
acctccgtct 
caggcacctg 
atgtttacca 
ctaagtgctg 
ccggtttggt 
gggaggtaca 
ttgttgacag 
ccaccaagcc 
cggtttcttt 
taacagtacc 
tgttatttct 
ggttagttac 
tagcattagg 
ccagagtgtg 
gagtgagaac 
ttccaatttc 
gtattccatg 
gggttagttc 
gtctttatag 
gtcaaatggt 
ttaaactagt 
ccagcacctg 
tcccattgtg 
tgtcttttgg 
tttttgatgg 
gatattagcc 
ctgttcaccc 
catttgtcaa 
cccatgccta 
ggtctaacgt 
ggatccagtt 
agggaatcct 
atatgtggcg 
taccagtacc 
gtgatgcctc 
tggttccata 
cttgatgggg 
attgattctt 
ttccttgagc 
gattcctaag 
gctctctgtt 
gtatcctgag 
ggggttttct 
tcctaattga 
cactatgttg 



taaaattaat 
aagaagaaat 
gaaacttgtg 
gcttgtatta 
aaagaaataa 
agcccaaagt 
atgtatttaa 
aaaagtttaa 
aaaggtgggt 
ccccgtctct 
aatcccagct 
gcagtgagcc 
tcaaaaaaaa 
ttgtcttcca 
ggctcaagct 
cagctaattt 
aaactcctgg 
atgagccacc 
gcgcttgtcg 
cccgggtcca 
tcaccacgcc 
ggctagtctt 
ggattacagg 
agtgcatgcc 
ggctgcggtg 
accctgtctc 
tagagagttt 
gaattccagt 
aaaattaaaa 
ttttttttta 
atatgtatac 
tatatctcct 
atgttcccct 
atgcggtgtt 
atccatgtcc 
gtgtatatgt 
caagtctttg 
cagcatgatt 
atttgtagtt 
ttacagtccc 
ttgtttcctg 
gttttgattt 
ctgcataaat 
ggttgtttgg 
ctttgtcaga 
tgatggtagt 
ttttggcttt 
tgtcctgaat 
ttaagtcttt 
tcagctttct 
ttccccattg 
ttatttctga 
atgctgtttt 
cagctttgtt 
tgaactttaa 
atggcattga 
cctaccgatg 
agtggtttgt 
tattttattc 
tgtctgttat 
actttgctga 
atacatacag 
atacccttta 
aataggagtg 



aaagatgaat 
cagaaagatt 
gtattcggcc 
ggataaaaag 
taaaggtaac 
tgagattcaa 
aatttaaatg 
atttggctaa 
ggatcacctg 
actaaaaata 
actcgggagg 
gagatcacgc 
agtttaatac 
ggctgcagtg 
atcctcccac 
ttgtattttt 
gctcaaccga 
acacccgatt 
cccgggctgg 
agtgcttctt 
tggctaattt 
gaattcctga 
cgtgagcaac 
tgaagttcca 
agccatgatc 
aaacaaaata 
tattaatgag 
tttatacaga 
ataactaaat 
attatacttt 
atgtgccatg 
aatgctatcc 
tcctgtgtcc 
tggttttttg 
ctacaaagga 
gccacatttt 
ctattgtgaa 
tatagtcctt 
ctagatccct 
accaatagtg 
acttcttaat 
gcatttctct 
gtcttctttt 
ttttttcttg 
tgagtaggtt 
ttcttttgct 
tattgccatt 
ggtaatgtct 
aatccatctt 
acatatggct 
cttgtttttc 
ggcctctgtt 
ggttactgta 
cttttggctt 
agtagttttt 
atctataaat 
agcatggaat 
agttcttctt 
tctttgaagc 
cggtgtataa 
agttgcttat 
tcatgtcgtc 
tttccttatc 
gtgagagagg 



agtaaaaatc 
agagcataat 
aagacagcat 
ttgaaaatta 
agcagaaatt 
aaagagtatg 
gtcctttaaa 
cgcagtgact 
aggtcaggtg 
caaaaaatta 
ctaaggcagg 
cattgcactc 
ataatttaat 
cagtgacaca 
ctcagcctcc 
agtagagatg 
tctgcccatt 
tgactcaata 
agtgcggtgg 
atgcctcagc 
ttgtattttt 
ccttaggtgc 
tgcgcccagc 
gctactcagt 
atgacaccac 
aaaacaacag 
aatagtgagt 
aagtataaag 
ctcttttttt 
aagttttagg 
ctggtgtgct 
ctcccccctc 
atgtgttctc 
tccttgccat 
catgaactca 
cttaatccag 
tagtgccgca 
tgggtgtata 
gaggaatcac 
taaaactatt 
gattgccatt 
gatggccagt 
gagaagtgtc 
taaatttgtt 
gcgaaaattt 
gtgcagaagc 
gcttttggtg 
aggttttctt 
gaattaattt 
agccagtttt 
tcaggtttgt 
ctgttccatt 
gccttgtagt 
aggattgact 
ttccaattct 
taccttgggc 
gttcttccat 
gaagaggtcc 
aattgtgaat 
gaatgcttgt 
cagcttaagg 
tgcaaacagg 
ctgcctaatt 
gcatccctgt 
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77161 
77221 
77281 
77341 
77401 
77461 
77521 
77581 
77641 
77701 
77761 
77821 
77881 
77941 
78001 
78061 
78121 
78181 
78241 
78301 
78361 
78421 
78481 
78541 
78601 
78661 
78721 
78781 
78841 
78901 
78961 
79021 
79081 
79141 
79201 
79261 
79321 
79381 
79441 
79501 
79561 
79621 
79681 
79741 
79801 
79861 
79921 
79981 
80041 
80101 
80161 
80221 
80281 
80341 
80401 
80461 
80521 
80581 
80641 
80701 
80761 
80821 
80881 
80941 



cttgtgccag 
atagcaaatt 
atctcaggaa 
ttagagatgc 
tattttcttc 
atttgatcaa 
gatcatctat 
actcttggca 
gtagtaacca 
gcttcctatt 
ctaagcatga 
agatgatata 
tatgtaacat 
tttcagtgac 
tatctaggga 
aataagtaga 
ttcttcaact 
gaacttgatg 
gcaaaacttg 
tagggagatc 
gcagtccctg 
atatatttat 
aatgttctgg 
attgtatatt 
attttaaaaa 
aagaccagcc 
gacgtggtgg 
tgatccctgg 
ctgggcaaca 
ccaggcgcag 
cctgaggtca 
agtacaaaat 
aagcaggata 
tgccctccag 
aatgccagac 
gcaatatttc 
atggagtctt 
ccctccacct 
caggcacgtg 
cgtgttagcc 
aagtgctggg 
ttggaaagga 
gccacaagtt 
ctagtatctt 
aacaaaagat 
tgaagcactg 
atcttctatg 
acttcatagc 
taggcttttg 
acctatgttt 
tttttcttct 
tttgttacat 
cattaggtat 
gtgtgtgatg 
tgagaacatg 
cagcttcatc 
ttccatggtg 
ttggttccaa 
tttatagcag 
aaatggtatt 
aactagttta 
gcacctgttg 
tccgagctaa 
ttagatgaat 



ttttcaaagg 
agagacagtg 
tgcatggcag 
ccttttaaaa 
cttgctgaaa 
atatggtaat 
cttgatttgc 
gaggagaaac 
tcttactgaa 
aagaccacca 
aagaaataaa 
attacataga 
accagcaaca 
attaaataat 
tgcgtataac 
aagtaatacc 
ggtccataga 
agttagggcc 
ctttaataat 
aaaaccaaat 
gggaaagaat 
atgccattac 
aattagatgt 
tttaaagcat 
tcccagcact 
tgaccaacat 
cacgtgcctg 
aggcggaggc 
gagcgagatt 
tggctcacac 
ggagttcgag 
tagccgggtg 
atcacttgaa 
cctgggcaac 
agactgaaaa 
accacaagag 
gctctgttgc 
cccggattca 
ccaccatgaa 
aggatggcct 
attacaggcg 
tgtttcgtga 
aagattaccc 
cctggagcca 
tatggacaca 
gcatcaacca 
tgcatgtgca 
cgaaggatag 
tgcctcctgg 
cagtgcaact 
tttttttaat 
atgtatacat 
atctcttaat 
tttcccttcc 
cagtgtttgg 
catgtcccta 
tatatgtgcc 
gtctttgcta 
catgatttat 
tctagttcta 
cagtcccacc 
tttcctgact 
aagaggaagt 
ggctaactgg 



gaatgcttcc 
ggtattaaca 
ttctgtcttt 
aaacatatta 
cataaaataa 
gtaatttgcc 
gagaagcatt 
agcaaagaat 
agatcaaact 
catctactca 
aattataaga 
aaatcaagaa 
aatggtaaga 
aaggattcag 
aaaaaaatta 
atctttatgg 
caacacaatt 
aaaagtaatc 
ttgtgcaggg 
tcacacagac 
ggaaatggtg 
attaaagtca 
gatgtttgca 
ggtatgtgaa 
ttgggaggct 
gttgaaaccc 
taatctaagc 
tgccgtgagc 
ccatctcaaa 
ctgtaatccc 
accagcctga 
tggtggtgca 
cccgggaggc 
aagagcgaaa 
cttaacatgc 
ggcagtttga 
ccaggctgga 
agtgattacc 
cgactaattt 
tgatctcctg 
tgaactttga 
ctcaaaagac 
tggaatccag 
gtaacatgaa 
acttcgtggt 
caaagaaagt 
tgtgttgtgt 
tcctatatgg 
ccccggcatt 
gtaaaatgag 
tatactttaa 
atgccatgtt 
gctatccctc 
tgtgtccaag 
ttttctgtcc 
caaaggacat 
acattttctt 
ttgtgaatag 
aatcctttgg 
gatccctgag 
aacagtgtaa 
ttttaatgat 
tcgaacccat 
aataaccagt 



agtttttgcc 
attgcatata 
agaaaatgta 
caaattggaa 
tgacgtgttt 
attctaacag 
tgataatatt 
ttttaaaata 
tgttcccttt 
tctttaagct 
taaaaaggaa 
gattccatgg 
aaaacacgtt 
gaatacaacg 
taaagcttca 
aacagagtca 
ttaactaaga 
aagatattca 
atagacaaat 
atgaaaagtt 
ctagaataat 
atgcctatgg 
caactctgtg 
taatacctca 
gaggcgggcg 
catctctatt 
tactcaggtg 
cgagatgatg 
aaaaaaaaaa 
agcactttgg 
ccaacatgga 
tgcctgtaat 
ggcggttgtg 
ttccatctca 
aaagggcaag 
ccctgaactt 
gtgcagtggc 
ctgcctcagc 
tttgtgtttt 
accttgtgat 
ttttatacag 
tactggggat 
tcaaagctgg 
tatgatgacc 
atgtgtccat 
tatgtgagca 
atgttgacag 
aaacgcaata 
gccactgaat 
gataatatta 
gttctagggt 
ggtgttctgc 
ccccatcccc 
tgttctcctt 
ttgcgataat 
gaactcatcc 
aatccagtct 
tgccgcaata 
gtatataccc 
gaatcaccac 
aagtattcct 
tgccattcta 
cagaaagaag 
gtagagaagt 



cattcagtaa 
tcacaataaa 
ttttttgtat 
tattcaaagg 
taaaatttat 
atcaaaatca 
caaaatccat 
taaagggtat 
cagcttggga 
ggtgattctc 
agaaaactct 
ataggcttcc 
gttttttttt 
aatataacaa 
tttaaaacat 
gtattataaa 
ttccaattgg 
tgaggaagga 
actgtacacc 
gatttatgac 
tggtttctat 
agtttgggag 
aatgtactaa 
ataaagctgt 
gatcatgagg 
aaaagtacaa 
gctgaggcag 
cgcctggcac 
aagtaaaaaa 
gaagccgagg 
gaaaccccgt 
cccagctact 
gtgagccaag 
aaaaaaaaaa 
tattgaagac 
tggtgtgtgt 
gtgatcttgg 
ctcccgagta 
tagtagagac 
ctgcccacct 
gaagtggttg 
taactagctt 
tgttatgatt 
aaatgtttat 
ttaggcccac 
ctttctctgg 
gtcacacatc 
tgaaagtgtg 
tgtgtgactt 
gtaattacct 
acatgtgcac 
acccattaac 
ccaccccacg 
gttcaattcc 
ttgctcagaa 
ttttttatgg 
atcactgatg 
aacatacgtg 
agtaatggaa 
actgtcttcc 
gtttctccac 
acgggtgtga 
ctaaaaacct 
ccttaaatga 



acatggaaaa 
tgtatgttta 
ttttatcaat 
gcagctgttt 
agaatctttg 
gaatcagtat 
ttatgaaaaa 
gtacaaaact 
ataagacaga 
accaacacat 
cagtacttaa 
caaaaaacaa 
taaagtacat 
aaatgtaaga 
ttttaaacaa 
tgtgtcagtt 
gatttctgtg 
taacagaatg 
agtctctcaa 
agatgatgtt 
atgtatttct 
gagtgatcaa 
gaaccactga 
tataaaaaat 
tcaggagttc 
aaattagcca 
gaaaatcgct 
tccatccagc 
tcaatgacgg 
tggcagatca 
ctctactaaa 
ggcgaggctg 
aacgcgccat 
aaaaaaaatc 
caaacagata 
ttgttttgag 
ctcactgcaa 
gctgggacta 
ggggtttcac 
cagcttccca 
gaggaaatgg 
gatagcagtg 
ctagtttgag 
ttcgtctgaa 
cgtaaagaag 
atgataaatt 
agaaactgcc 
attaataata 
tgggcaagtt 
catggtattt 
aacgtgcagg 
tcgtcattta 
acaggcccca 
cacctatgag 
tgttggtttc 
ctgcatagta 
gacatttggg 
tgcatgtgtc 
tggctgggtc 
acaatggttg 
ttcctctcca 
gataaacttc 
tgaaaaaaga 
tctgatggag 
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81001 
81061 
81121 
81181 
81241 
81301 
81361 
81421 
81481 
81541 
81601 
81661 
81721 
81781 
81841 
81901 
81961 
82021 
82081 
82141 
82201 
82261 
82321 
82381 
82441 
82501 
82561 
82621 
82681 
82741 
82801 
82861 
82921 
82981 
83041 
83101 
83161 
83221 
83281 
83341 
83401 
83461 
83521 
83581 
83641 
83701 
83761 
83821 
83881 
83941 
84001 
84061 
84121 
84181 
84241 
84301 
84361 
84421 
84481 
84541 
84601 
84661 
84721 
84781 



ctgaaaatga 
caactggaag 
aagtttagag 
gtgaaaagac 
aagttggaaa 
gccaacattc 
actccaagac 
gcagccagag 
ctctcggcag 
caaaagaatt 
aaaataaaat 
tacaagagct 
aaaaacatgc 
caaagtaacc 
taaatgtaaa 
gtcaagaccc 
ggctcaaaat 
gggttgcaat 
ggccattata 
tatgcaccca 
gacttagact 
agatcaagga 
cctaatagac 
accacattgc 
atgcagaaga 
caggattaag 
gaatgactac 
tgagaacaaa 
gaaatttata 
aacatcacaa 
gcaagaaata 
aaaaatcaat 
agcaagtcta 
aggggata tc 
cctctgtgca 
cctcccaaga 
aattgaggca 
atgtaaatca 
tttatagggt 
ttttgggtct 
tttacaatat 
atattgtcaa 
acagtttcag 
gaaaccagtg 
gggtcatgaa 
ctggcaacca 
tctgcattaa 
aatgccttgg 
ctttgggata 
aagatcctca 
aacagggaag 
tttgttgtgt 
attttgagat 
tttaattgat 
ctcttaatct 
ctgttattac 
cttttgtttg 
tcaaccatca 
catcaactct 
atgttgaatg 
ccaacattat 
cttgccagta 
ccccaagctg 
gtgggggttg 



tggcacgaga 
aaagggtatc 
aaaaaagagt 
caaatctatg 
acactctgca 
aaattcagga 
acataattgt 
agaaaggtcg 
aaactctaca 
ttcaacccag 
cctttactga 
cctgaaggaa 
caaaatgtaa 
agctaacatc 
tgggctaaat 
atcagtgtgc 
aaagggaggg 
cctatctctg 
taatggtaaa 
atacaggagc 
cccacacaat 
gacagaaggt 
atctacagaa 
acttattcca 
acagaaataa 
aaactcactc 
tgggtacata 
gacacaacgt 
gcactaaatg 
ttaaaagaac 
actaagatca 
gaatccagga 
ataaagaaga 
accaccgatc 
aataaactag 
ctaaatcagg 
ataattaata 
gtttatagat 
cagaccggtg 
ttaaggatga 
aattttttta 
tgccgtattc 
agtctccact 
tgggggttgt 
accagtgaac 
taaccctttt 
ttcttaggta 
cttgtctctt 
ggtatgagag 
ttgttgttca 
acatcatgac 
atcatatggc 
ttacacagat 
tccattgatt 
tcatagctct 
agtaattaag 
gggatcctgt 
gacttagcca 
cacaagattg 
taatatgaca 
tctccttgac 
catcttaact 
tgttatatca 
cattactctg 



actacgtgac 
agtgattgaa 
aaaaagaaat 
tctgattagt 
ggatattatc 
aatacagaga 
gagattcacc 
ggttacccac 
agccagaaga 
aattttacat 
caagcaaatc 
gcactaaaca 
agaccatcaa 
ataatgacag 
gctccaatta 
tgtattcagg 
aggaagatct 
ataaaacaga 
tggatcaatt 
acccagattc 
aataatggga 
taacaaggat 
ctctccacca 
aaattgagca 
caaactgtct 
aaaaccactc 
acgaaatgaa 
accagaatct 
cccacaagag 
tagagaagca 
gagcagaact 
gctggttttt 
aaagagagaa 
ccacagaaat 
aaaatctaga 
aagaagttga 
gcctaccaat 
gagcaggagt 
aattaagtag 
ctttaacgtc 
catttttaca 
ttttaaaatt 
tggctttccc 
gctaactaca 
ctactgtgag 
ccccaacaag 
ttaatgtcaa 
tcccttgtat 
aatcattgta 
gtcattgggt 
tttttgttgg 
atgtactctt 
tgttagtgca 
gcaatgctgc 
ccgagggtcg 
cctattttgg 
tattcccatt 
ctggtgtcta 
ttacattttt 
ggagaatgat 
tgcattttca 
aggacctgca 
tgattgatcc 
ttctcacatt 



aaatgcacaa 
gatcaaacga 
gaataaagcc 
gtacctgaaa 
caagagaatt 
acaccacaaa 
aaagttgaaa 
aaagggaagc 
gagtgggggc 
ccagccaaac 
tgagagactt 
tagaaaggaa 
tgctagaaag 
gatcaaattc 
aaagacacag 
agacccatct 
accaagcaaa 
atataaacca 
caacaagaag 
ataaagcaag 
gactttaaca 
atccaggact 
caaatcaaca 
catagttgga 
ctcagaccac 
aactacatgg 
ggcagaaata 
ctgggacaca 
aaagcaggaa 
agagcaaaca 
gaaggaaata 
tgaaaagatc 
gaatcaaaca 
acaaactacc 
agaaatggat 
atccctgaat 
caaaaaaaga 
ctagtatcat 
gcacttgatg 
taccatagtc 
attatttata 
tactatattg 
cacttccgta 
aagtcagtga 
tgggatcttt 
gtggccatga 
aacagactct 
acagtaacct 
aacacttgca 
tctgaatctg 
gaagtgctct 
tgtgtctggc 
tcgatctttt 
ctgtctctct 
gaaactccat 
ttctgatggt 
gccatcatgt 
gaaccagctc 
tgaagtttta 
ccatccttat 
tgttccagtc 
gcgccttttc 
taatttttta 
gctataaaga 



gcttcagtag 
atgaaatgaa 
tccaagaaat 
gtgatgggag 
tccccaacct 
aatactcctc 
tgaaggaaaa 
ccgtcaaact 
caatattcaa 
taagcttcat 
tgtcaccacc 
caaccagtac 
aaactgcatt 
ccacataaca 
actggcaaat 
catgtgcaga 
tggaaaacaa 
acaaagatga 
agctaactat 
tccttggaga 
ccccactgtc 
tcacctctgc 
gaatatacat 
agtaaagcac 
agtgcaatca 
aaactgaaca 
aaggtgtttt 
tttcaagcag 
agatctaaaa 
cattcaaaag 
gaaacacaaa 
aacaaaattg 
gatgcaataa 
atcagagaat 
aaattcctgc 
agaccaataa 
cctcatggta 
cctttaagac 
gggatcacag 
ccaggcaata 
atatttgtct 
gctgagacag 
gctcttacac 
ccagggaaat 
gcaaggaccc 
taatgcccct 
gccaccaaca 
gagtaaatgg 
ttatagggtt 
aaaaaggtct 
caggatttta 
ttctttcatt 
tatttttgaa 
tgcccttagg 
tacttgctac 
ttagatttgc 
agcgtagttt 
tcacaagatg 
tgaaccagtt 
gtcgatatac 
tcatgtgtat 
ctccataaca 
ttgggtagcc 
aatacctgcg 



ccgatttgat 
gtgagaagag 
atgggagtat 
gaatggaacc 
agcaaggcag 
aagaagagca 
aatgttaagg 
aacagcagat 
cattcttaaa 
aagtgaagga 
aggcctgccc 
cagccactgc 
aactaacgag 
gtattaacct 
tggataaaga 
ggcacacata 
aaaaaggtag 
aaagagacaa 
cctaaatata 
cttacaaaga 
aacattagac 
accaagggga 
tcttctcagc 
tcctcagcaa 
aattagaact 
acctgctcct 
ttgaaaccaa 
tgtgtagagg 
ttgacaccct 
ctagcagaag 
aaacccttca 
atagaccact 
aaaatgataa 
actgtaaata 
acacatacac 
caggctctga 
tttttatgga 
atatctggtt 
tgcctgcaac 
ttgtttttaa 
tttatgtaca 
gatgagaggg 
cattggttaa 
aatcactgat 
catgtatcat 
ttgaattaat 
aactcctcaa 
ccacaggtct 
tttctcacgc 
caagttctga 
gtcatctatc 
aagtacaatg 
tagcatccca 
gactcatgtt 
tcagactttg 
caccaggcac 
ggtaccaaca 
gatttattct 
gttacccaag 
tctcccttat 
tttctcagct 
gacccagcac 
agaatgtgag 
attaggtaat 
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84841 
84901 
84961 
85021 
85081 
85141 
85201 
85261 
85321 
85381 
85441 
85501 
85561 
85621 
85681 
85741 
85801 
85861 
85921 
85981 
86041 
86101 
86161 
86221 
86281 
86341 
86401 
86461 
86521 
86581 
86641 
86701 
86761 
86821 
86881 
86941 
87001 
87061 
87121 
87181 
87241 
87301 
87361 
87421 
87481 
87541 
87601 
87661 
87721 
87781 
87841 
87901 
87961 
88021 
88081 
88141 
88201 
88261 
88321 
88381 
88441 
88501 
88561 
88621 



tgataaaaga 
caagtgcttg 
ggagcaggca 
atgaccagat 
taagccattc 
caacattggg 
tgtgtatcct 
cttggcatca 
ctgcttgtgc 
ccaagaggac 
cttgatgaag 
cttggaatta 
agcatctgct 
acccttagcc 
ttccactttt 
tgtttgtatg 
gagacagggt 
caacctcagc 
tcaagacaac 
cttccttcta 
agaattttat 
tttcattaag 
tgttgaatag 
atggacattt 
atatacaagt 
atggctggga 
agtcattgta 
cttcaccagc 
actcattgta 
gcacctattt 
cacaatcaag 
ttctatcttc 
ttctgtcatt 
tatatgtatc 
cactgtatgg 
ttttgccaaa 
aagctttaat 
tgtatatagt 
gaaataaaaa 
cttcaagaaa 
aactccaaga 
agtttattgt 
aaatttatct 
aaatatgaaa 
atataaattt 
ttataactgg 
atatactgta 
aaaaattaat 
ctgcaggatc 
tataaccaga 
tgggaacaca 
attggaattt 
gacttttatg 
atattgctta 
aaaacatttc 
gaggcaggca 
ctccatctct 
acataactct 
cattaattat 
aattacaatt 
tatccataat 
cttccaaaga 
ttaagttgca 
caagtaatca 



ggtttaattg 
gcttctgggg 
tatcacatgg 
cttacaagag 
atgagaaatc 
gattttttat 
gggcagaggt 
tcttccagga 
cagcccggag 
ccattccaag 
ccaacttgcc 
aaggctgaat 
gccaaagagg 
tttcatttcc 
tttgtttaca 
tatgtgtgtg 
ctcgctttgc 
cttccacatt 
aaacatttca 
tccccaggta 
ataaatggaa 
tataatgatt 
tgtatcccat 
gggctatttc 
ctttgtgtgg 
tatttggcaa 
ctcttttgca 
agtgggcaaa 
gttttttcat 
aaactgtaca 
atgataaata 
tctcccctcc 
agtttgcatt 
aataggtcat 
ctaaaccatt 
cacaaataaa 
ttctcttagg 
atttaacttt 
tgagaaacaa 
agggagcact 
taaaaggcat 
caaaaatgat 
gctctatttt 
atgcaaaacc 
attacttaaa 
acagataata 
gtgattagat 
actctgattt 
ttttttagct 
ccatcatatc 
actgaacagt 
tggaaaactt 
atatatttat 
atgaaaccca 
ttgggagggc 
gattacttga 
acttaaacaa 
tctgagagta 
atttatattt 
ttcttctgcc 
cttaaaatta 
aaaaaaaatc 
tcctcacata 
aagagtacag 



gctcacgttt 
aggcctcagg 
ccagagcagg 
ctcactcact 
tgccaccatg 
atgagatttg 
gggtgggtgg 
aatgggtaga 
gaatcagggg 
tcttaagatc 
tgagttgaga 
ctgatcctca 
ccttctagct 
cccccaaagt 
gatttattga 
cagtgcacaa 
cacacaggct 
tggtaagttt 
tcaaccccaa 
actcctgatt 
taatacagca 
ttgagattta 
ttaatagatc 
caggtttggg 
acatgttccc 
aagtgtttta 
ttctcatcag 
gtcttttaac 
ttcattttta 
atttgatgag 
tattcatcac 
ttgctcctga 
ttctagagtt 
acctttttgt 
attagtttac 
gtggctatga 
taaataccta 
tttttataac 
aataaaacaa 
ttacaatgag 
tttgacaaca 
gtatgtccat 
taaatatttt 
ttttattgct 
caaaaatctt 
attgtacata 
cagggtaatt 
agtaagtgaa 
ttgagagcca 
attttatctc 
attgtttact 
gtatcctccc 
ggtgagacca 
tcaataccaa 
atggtggctc 
ggccagaagt 
aaaaacagaa 
aaagaggtgc 
ttcagatccc 
agagccaaaa 
ttgttctcag 
caataagaga 
gaatgctctc 
taaatgccta 



ctgcaggctg 
aaacttacaa 
agcaagaaaa 
gttgtgatga 
atccaatcac 
ggtggggaca 
ggggtggcat 
gctaaccttt 
aagtgcgatt 
acagtccttc 
atcatatatt 
gttttttctg 
ttcacacttt 
attcattgat 
agtttaattg 
tttgctaagt 
ggagtgcaat 
tgacatRtat 
aagtttcctc 
tgctttctga 
tgtattcttt 
tgtatctcgt 
tatcaaatgt 
ctattatgaa 
atatttcttg 
agttttcaag 
cagtgtagga 
tttagccatt 
aattgaggta 
tttgacatat 
cctcaaaagt 
ttccctaccc 
ttatacaagt 
tacttacatc 
ctgttgacag 
acatttacat 
aggagtagaa 
actgtaaaac 
atgtatacca 
tgagaaaata 
gatatttgga 
ctataaaact 
caaaaagtta 
tccaagagta 
cctttttagc 
ttcatggggt 
aacatatctt 
gctaactctt 
ttaacaccaa 
ataaaatgaa 
agtgaaaaaa 
tatgagcttg 
gtgataatat 
cccaaatttt 
atgcctataa 
tcaagaccac 
ttcttgcaat 
attttttgtt 
aaattggcaa 
atgtaagaaa 
cacaatgaag 
ttaaatatgt 
tcagacactg 
ttcttcctta 



tacaggaagc 
ccatggcaga 
tgagggggag 
cagtaccaag 
ctctgaccat 
cacatccaaa 
ttatcttgca 
aagagaggga 
caagattttc 
ccatcaatac 
tctagagctt 
cctccagact 
gccttcagtc 
agcatttagc 
atgacaataa 
ttttgtttgt 
ggcacagtca 
ctatgaaacc 
ctgctccttt 
taggttagtt 
tttttttttg 
tggtgtatca 
gtttatccat 
taaagctgct 
aataaatatg 
aaagtgccca 
gagttccagc 
ctagtgggtg 
caattgacat 
gtatatacca 
tttctcatac 
cagataacca 
ggaatctaca 
tttgttactg 
acatttgggt 
acaagtattt 
tggctggatc 
tgtttcatag 
aaacatgttt 
tatatccaca 
aatatttcca 
ctcctatctg 
gttttaaacc 
atttattgac 
tcctcttttt 
acatagcgat 
tttaattggt 
ccatttgaat 
atattgaagt 
gcaagtattc 
aattgagttt 
acagcttccc 
taacaattgt 
ataaaaacaa 
tcccagcact 
cctggccaac 
ataggtgacc 
agagttatcc 
aagttggaga 
actcctagat 
aaattattga 
attgtaattt 
gcttattcta 
gaacactgct 



acagtgctaa 
aggtgaaggg 
atgcttttaa 
acgatggtgc 
ggcctacctc 
gtatatcagg 
aagtagagga 
gaggagaagg 
agcatattaa 
cctccctgac 
gttcactact 
acaagagatg 
gatgattaat 
aatgatcata 
actacatatg 
ttttgttttt 
tagctcactg 
attatcaaaa 
tagttttatc 
tgcattttct 
gtctggctac 
atctttttat 
tgatctgttg 
atgaatattc 
tagaagtgga 
tctattttcc 
tgccccacac 
tataatggta 
gcaatgaact 
atgaaaccaa 
ccctatgaaa 
ctaatcttcc 
gtatgtactt 
tgacatattt 
tgtttctaat 
gtatggactt 
atataatagg 
tggttgtatg 
aatgtttcac 
ttttgaaaaa 
tcgttacaag 
ttgctaaaac 
tatttgttta 
atcgaagaaa 
tgttttgttt 
gtttcaatac 
gaatggagtg 
ccatacttct 
aaactaaact 
ctgtcacttc 
tcaagtgaaa 
aatatttaga 
tatttttgaa 
agcatattta 
ttgggaggcc 
atggtgaaac 
ctcataaatc 
agacttttta 
atttttctta 
accatggtta 
gaacaagtcc 
tagaagacag 
aatgcgtcag 
ggaacagatt 
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88681 
88741 
88801 
88861 
88921 
88981 
89041 
89101 
89161 
89221 
89281 
89341 
89401 
89461 
89521 
89581 
89641 
89701 
89761 
89821 
89881 
89941 
90001 
90061 
90121 
90181 
90241 
90301 
90361 
90421 
90481 
90541 
90601 
90661 
90721 
90781 
90841 
90901 
90961 
91021 
91081 
91141 
91201 
91261 
91321 
91381 
91441 
91501 
91561 
91621 
91681 
91741 



tccccctaac 
gatttgtact 
tgagatttaa 
gcaaagcaag 
tgactcagca 
ataccagtag 
cttactaagc 
aatcatcctt 
aggctctggg 
tcccatcatc 
aatacttact 
atattaataa 
taattggttt 
ggagtgcagt 
ggtacaccac 
tcaccaggct 
caagcaattc 
ttcagctaat 
tcaatctctt 
gtgagccacc 
atgttgccca 
aaagtgcgga 
cctcacaaca 
ccaaaagtca 
gaggtgatta 
ttcattattg 
ctctgactct 
accttcatca 
ataaatttct 
cagactaaga 
cagaagttaa 
aggcacttcc 
aaatcagaaa 
aagatttata 
tcccagcact 
cctggccaac 
ggtgcatgcc 
ggaagcggag 
cgagactctg 
gactcttctt 
ctatattttg 
cagcacagtg 
cagcagatat 
acatcagaca 
tattccatca 
aacaataaaa 
gcttagatat 
cttttttttg 
ggctctctgc 
aggcgtgtgc 
gtgttgacca 
aaagtgctgg 



cctggctaag 
tattctaata 
gtgcaatgtc 
ggaattgttg 
ttgcatttta 
cattggcttt 
aaactcctga 
gaaaaactct 
tttgaacctt 
actctgtgtc 
aaatccttta 
taaatataaa 
attttattat 
ggcactcagg 
catgcccagc 
ggagtgcagt 
tcctgcctca 
ttttgtattt 
gatcttgtga 
acacctggcc 
ggctgctctt 
gattacaggt 
tccctgtagt 
tgttgaaatt 
ggccatgaga 
caggacaggg 
caaggtctct 
gatgctggta 
attcccttaa 
cagtatccct 
ttgtccattg 
caatatgtca 
catcatacaa 
gaattaataa 
ttgggaggcc 
atggcaaaac 
tgtagtccca 
gttgcagtga 
tctcaaaaaa 
gaacaaagcc 
ttttgttgcc 
gcctgaaaaa 
ttattgcagc 
aggttcttgc 
ataaatacat 
caggatgatt 
agttaaatgg 
agtcagagtc 
aacctctgcc 
caccatgccc 
gcctggtctc 
gattacaggc 



gctctcttta 
acagttgatc 
agcttttttt 
ccaagtagag 
gtctcatgac 
aggtttagta 
ttattgctgc 
ttactatatg 
ggctatacca 
tcagttacca 
catttattat 
ggatttagtc 
ttatttattt 
tgattctcct 
taattttttt 
ggcatgatct 
gcctcccaag 
ttagtagaga 
tctgcccacc 
tgctaatttt 
gaactcctga 
gtgagtcacc 
tattgtgggg 
taattgccat 
gctctcataa 
ttcctcataa 
tgcttttcca 
ccttgatctt 
aattacccag 
aagggtaaat 
acccagagcc 
ttctaaatct 
gctcagtctc 
ataagtgtat 
gaggtgggtg 
cctgtctcca 
gctactcagg 
gccaagatcc 
aaaaaataaa 
tgcattattt 
ttaatatgat 
ttgaggatac 
ccctattgta 
tcttatagaa 
aaaaaagaca 
ggatacagga 
aaaggcttgt 
tccctctgtt 
tcctgggttc 
aggtaatttt 
gaactcctga 
atgagccact 



tttattagat 
cataggaatt 
gttttttttt 
cacactatgc 
tttgctggga 
gccatgagtt 
aagaacttct 
gaaaacgtaa 
ccacttacac 
tatatacaaa 
gaatataaac 
tggtactaaa 
agacagagtt 
gcctcagcct 
tttttttttt 
cggctcactg 
tagctgggac 
cggggtttca 
ttggcctccc 
taatattttt 
gctcaagtga 
atgccagacc 
taggtatctg 
tgtaacagta 
atgtattaac 
aaggacgagt 
ccttctgcta 
gaactttccc 
tctgtagtat 
tgagtttatt 
tgtgtgtggc 
acaaatttca 
ttacgtggtc 
gaggctgggc 
gatcacctga 
ctaaaaatac 
aggctgaggc 
caccactgca 
ttaattaaaa 
aattgttgag 
aataggacag 
tagaaaagtt 
tgcaggcact 
atgacattat 
atttcagata 
gatgggaaga 
ctttggaggt 
gcccaggctg 
aagcgattct 
tgtatttttt 
ccgcaaatga 
gcacctggcc 



ggattaaact 
cagaaaggct 
cttttttcag 
agaacctggg 
atcagatgca 
gagtttagga 
tggtcaaagc 
agaacaatgg 
atcatgtgac 
ataacaataa 
cacatgatcc 
tgaccagtaa 
tcactcccat 
cccaagtagc 
tgagatggag 
caacctccac 
tacaggcgtg 
ccatattgtc 
aaagtgctgg 
tgtagacaca 
tctgcctgcc 
agtttgcttt 
ctatggtttg 
ctaagaggtg 
gacattattg 
ttggcacccc 
tgggatgtca 
acctgcagaa 
tctgttatag 
tctgagagta 
agttcctgtg 
tttatgtctt 
ttagacttcg 
gcaatggctc 
ggtcaggagt 
aaaaattagc 
aggagaattg 
ctccagtctg 
agtgtgtgat 
tttatttaat 
agacagtatt 
cttttatata 
gctgtgatac 
agagagagga 
gtgtaagtaa 
gatggtgaga 
ggcatatgag 
gagtgcagtg 
catgcctcag 
tgtagagaca 
tcagcccacc 
ctcaatactt 



taattccctt 
ttttcagact 
tttggctaaa 
acatatagga 
gctatcgaga 
aaggcagaat 
attcttgaca 
gtgagagcac 
cgtggttaag 
catactgcta 
tatgtggctt 
atgttattta 
tgcctaggct 
agggaccaca 
tctcactctg 
ctcctgggtt 
cgccaccatg 
caggatggtc 
gataataggc 
cggtttcacc 
ttggcctccc 
atttcttaag 
aatgtccact 
gaacctttaa 
tgggagtggg 
atttgtttct 
taacaagaag 
gtatgaggac 
cagcacaaat 
aactgaggct 
attccaaaca 
gtaagtgtag 
ccaaattaag 
atgcctgtaa 
ttgagaccag 
cagtagtggt 
cttgaacccg 
ggcaacagaa 
attgaacttt 
tttctgttga 
gtagagagtt 
ttaatctatc 
agcaatggat 
gaagaacatt 
cagctaagaa 
agtggggaag 
ctcagtactt 
gcaYgatctt 
ctgggattac 
gggtttcgcc 
tcagcctccc 
tgatcattag 



[0241] The following is a human cDNA sequence for TM7SF3. 
TM7SF3 cDNA Sequence (SEP ID NO: 2) 

NM_016551 Homo sapiens transmembrane 7 superfamily member 3 (TM7SF3), mRNA 

1 ccagccctgg cgtgggccca gcccggccca ggcagcaatg gggttcctgc agctgctggt 
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61 cgtagcggtg ctggcatccg aacaccgggt ggctggtgca gccgaggtct tcgggaattc 

121 cagcgagggt cttattgaat tttctgtggg gaaatttaga tacttcgagc tcaataggcc 

181 ctttccagag gaagctattt tgcatgatat ttcaagcaat gtgacttttc ttattttcca 

241 aatacactca cagtatcaga atacaactgt ttccttttct ccgactctcc tttccaattc 

301 ctcggaaaca ggcactgcca gtggactggt tttcatcctt agaccagagc agagtacatg 

361 cacttggtac ttggggactt caggcataca gcctgtccag aatatggcta tcctactctc 

421 ctactcagaa agagatcctg tccctggagg ctgtaatttg gagttcgatt tagatattga 

481 tcccaacatt tacttggagt ataatttctt tgaaacgact atcaagtttg ccccagcaaa 

541 cctaggctat gcgagaggcg tagatccccc accatgtgac gctgggacag accaggactc 

601 caggtggagg ttgcagtatg atgtctatca gtattttctg cctgagaatg acctcactga 

661 ggagatgttg ctgaagcatc tgcagaggat ggtcagtgtg ccccaggtga aggccagtgc 

721 tctcaaggtg gttaccctaa cagctaatga taagacaagt gtttccttct cctccctccc 

781 gggacaaggt gtcatataca atgtcattgt ttgggacccg tttctaaata catctgctgc 

841 ctacattcct gctcacacat acgcttgcag ctttgaggca ggagagggta gttgtgcttc 

901 cctaggaaga gtgtcttcca aagtgttctt cactcttttt gccctgcttg gtttcttcat 

961 ttgtttcttt ggacacagat tctggaaaac agaattattc ttcataggct ttatcatcat 

1021 gggattcttc ttttatatac tgattacaag actgacacct atcaagtatg atgtgaatct 

1081 gattctgaca gctgtcactg gaagcgtcgg tggaatgttc ttggtagctg tgtggtggcg 

1141 atttggaatc ctctcgatct gcatgctctg tgttggacta gtgctggggt tcctcatctc 

1201 gtcagtgact ttctttactc cactgggaaa cctaaagatt tttcatgatg atggtgtatt 

1261 ctgggtcact ttctcttgca tagctatcct cattccagta gttttcatgg gctgcctaag 

1321 aatactgaac atactgactt gtggagtcat tggctcctat tcggtggttt tagccattga 

1381 cagttactgg tccacaagcc tttcctacat cactttgaac gtactcaaga gagcgctcaa 

1441 caaggatttc cacagagctt tcacaaatgt gccttttcaa actaatgact tcattatcct 

1501 ggcagtatgg ggcatgctgg ctgtaagtgg aattacgtta cagattcgaa gagagagagg 

1561 acgaccgttc ttccctcccc acccatacaa gttatggaag caagagagag agcgccgagt 

1621 gacaaacatt ctggacccta gctaccacat tcctccattg agagagaggc tctatggccg 

1681 attaacccag attaaagggc tcttccagaa ggagcagcca gctggagaga gaacgccttt 

1741 gcttctgtag atgcccaggg gcttggtcag tgtgcctcag ctttggagtt catgcctgga 

1801 gtggttcaac agtctctggt gcaagtctaa taagagatca ggcatatata tctgttcttt 

1861 gcataatatt atggtgccct tattgatata tggtaagggt gtactagggg attaggatga 

1921 ttgtaagaga atgagaaaga tgaccaaaag gttggtggta gggaggcttt ttcttatttc 

1981 caaatacttg agaaattacc ttttggttta caaatctatg atcaacttat tccattaaat 

2041 agatacatta aaaaaattaa aaactgcaaa aaaaaaaaaa aaactggtgt ttctttttat 

2101 aaccccttga aacaagtctc tcacctgagc ctgtctaaac tttcggaggg agtttattat 

2161 tgagtcttta tctgtgacag tatttggaga tttagggatt tgatacttag gcctttgaat 

2221 tttagaatac aaaaagagaa gcaagccaga catggtggct cacacctgta atcccaatac 

2281 tgggaagcca aggtgggagt atcgcttgag cccaggagtt tgagaccgac atgggcaaca 

2341 tgacaagacc ccatctctac aaaaaaaatt aaaaaattag ccaggcatgg tggcacatgc 

2401 ctactcccag ctcccaagga gactgagatg ggaggatccc tggagccctg aagattgagg 

24 61 ctacagtgag ccttgattgt gtcactgcac tccagcttgg gtgacagaga ccctgtctcg 
2521 agaaatt 



[0242] The following is a human polypeptide sequences for TM7SF3. 
TM7SF3 Amino Acid Sequence (SEP ID NO: 3) 

NP057635 transmembrane 7 superfamily member 3; seven transmembrane protein TM7SF3 [Homo 
sapiens] 

MGFLQLLVVAVLASEHRVAGAAEVFGNSSEGLIEFSVGKFRYFELNRPFPEEAILHDISS 

NVTFLIFQIHSQYQNTTVSFSPTLLSNSSETGTASGLVFILRPEQSTCTWYLGTSGIQPVQN 

MAILLSYSEPvDPVPGGCNLEFDLDIDPNrYLEYNFFETTIKFAPANLGYARGVDPPPCDA 

GTDQDSRWRLQYDVYQYFLPENDLTEEMLLKHLQRMVSVPQVKASALKVVTLTAND 

KTSVSFSSLPGQGVrYNVIVWDPFLNTSAAYEPAHTYACSFEAGEGSCASLGRVSSKVFF 

TLFALLGFFICFFGHRFWKTELFFIGFIIMGFFFYILITRLTPIKYDVNLILTAVTGSVGGMF 

LVAVWWRFGILSICMLCVGLVLGFLISSVTFFTPLGNLKIFHDDGVFWVTFSCIAILIPVV 
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FMGCLRILNILTCGVIGSYSWLAIDSYWSTSLSYITLNVLKRALNKDFHRAFTNVPFQT 

NDFIILAVWGMLAVSGITLQIRRERGRPFFPPHPYKLWKQERERRVTNILDPSYHIPPLRE 

RLYGRLTQIKGLFQKEQPAGERTPLLL 

[0243] Modifications may be made to the foregoing without departing from the basic aspects of the 
invention. Although the invention has been described in substantial detail with reference to one or more 
specific embodiments, those of skill in the art will recognize that changes may be made to the 
embodiments specifically disclosed in this application, yet these modifications and improvements are 
within the scope and spirit of the invention, as set forth in the aspects which follow. All publications or 
patent documents cited in this specification are incorporated herein by reference as if each such 
publication or document was specifically and individually indicated to be incorporated herein by 
reference. 

[0244] Citation of the above publications or documents is not intended as an admission that any of 
the foregoing is pertinent prior art, nor does it constitute any admission as to the contents or date of these 
publications or documents. U.S. patents and other publications referenced herein are hereby incorporated 
by reference. 
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What is claimed is : 

1 . A method for identifying a subject at risk of osteoarthritis, which comprises detecting the 
presence or absence of one or more polymorphic variations associated with osteoarthritis in a nucleic acid 
sample from a subject, wherein the one or more polymorphic variations are detected in a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1 -2; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-2; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1 -2; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); 

whereby the presence of the polymorphic variation is indicative of the subject being at risk of 
osteoarthritis. 

2. The method of claim 1 , which further comprises obtaining the nucleic acid sample from the 

subject. 

3. The method of claim 1, wherein the one or more polymorphic variations are detected within 
a region spanning chromosome positions 27052000 to 27066000 in human genomic DNA. 

4. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions selected from the group consisting of 230, 231,5330, 6334, 11372, 11456, 11501, 
13393, 16666, 17596, 19710, 19800, 20297, 20967, 32514, 33159, 37600, 41259, 41329, 50060, 53292, 
53393, 56417, 56435, 58847, 59595, 59661, 60355, 60407, 62357, 68230, 68516, 69055, 72603, 73928, 
85897 and 91554. 

5. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 1 selected from the group consisting of 56435, 59595, 53292, 33159 
and 41329. 

6. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in linkage disequilibrium with one or more positions in claim 3, 4 or 5. 
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7. The method of claim 1 , wherein detecting the presence or absence of the one or more 
polymorphic variations comprises: 

hybridizing an oligonucleotide to the nucleic acid sample, wherein the oligonucleotide is 
complementary to a nucleotide sequence in the nucleic acid and hybridizes to a region adjacent to the 
polymorphic variation; 

extending the oligonucleotide in the presence of one or more nucleotides, yielding extension 
products; and 

detecting the presence or absence of a polymorphic variation in the extension products. 

8. The method of claim 1 , wherein the subject is a human. 

9. The method of claim 8, wherein the subject is a human female. 

10. The method of claim 8, wherein the subject is a human male. 

1 1 . A method for identifying a polymorphic variation associated with osteoarthritis proximal to 
an incident polymorphic variation associated with osteoarthritis, which comprises: 

identifying a polymorphic variation proximal to the incident polymorphic variation associated 
with osteoarthritis, wherein the polymorphic variation is detected in a nucleotide sequence selected from 
the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1 -2; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-2; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-2; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 

variation; 

determining the presence or absence of an association of the proximal polymorphic variant with 
osteoarthritis. 

12. The method of claim 11, wherein the incident polymorphic variation is at one or more 
positions in claim 3, 4 or 5. 
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13. The method of claim 11, wherein the proximal polymorphic variation is within a region 
between about 5 kb 5 ' of the incident polymorphic variation and about 5 kb 3 ' of the incident 
polymorphic variation. 

14. The method of claim 11, which further comprises determining whether the proximal 
polymorphic variation is in linkage disequilibrium with the incident polymorphic variation. 

15. The method of claim 11, which further comprises identifying a second polymorphic variation 
proximal to the identified proximal polymorphic variation associated with osteoarthritis and detennining 
if the second proximal polymorphic variation is associated with osteoarthritis. 

16. The method of claim 15, wherein the second proximal polymorphic variant is within a region 
between about 5 kb 5 ' of the incident polymorphic variation and about 5 kb 3 ' of the proximal 
polymorphic variation associated with osteoarthritis. 

1 7. An isolated nucleic acid comprising a nucleotide sequence selected from the group consisting 

of: 

(a) a nucleotide sequence in SEQ ID NO: 1 -2; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-2; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ED NO: 1-2; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

(e) a nucleotide sequence complementary to the nucleotide sequences of (a), (b), (c), or (d); 
wherein the nucleotide sequence comprises a polymorphic variation associated with osteoarthritis 

selected from the group consisting of a thymine thymine repeat at position 56435, a thymine at position 
59595, a cytosine at position 53292, a guanine at position 33159 and a thymine at position 41329. 

18. An oligonucleotide comprising a nucleotide sequence complementary to a portion of the 
nucleotide sequence of (a), (b), (c), or (d) in claim 1 7, wherein the 3 ' end of the oligonucleotide is 
adjacent to a polymorphic variation associated with osteoarthritis. 

19. A microarray comprising an isolated nucleic acid of claim 17 linked to a solid support. 
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20. An isolated polypeptide encoded by the isolated nucleic acid sequence of claim 17. 

21 . A method for identifying a candidate therapeutic for treating osteoarthritis, which comprises: 

(a) introducing a test molecule to a system which comprises a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of: 

(i) a nucleotide sequence in SEQ ID NO: 1-2; 

(ii) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide 
sequence in SEQ ID NO: 1-2; 

(iii) a nucleotide sequence which encodes a polypeptide that is 90% or more identical 
to the amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-2; 

(iv) a fragment of a nucleotide sequence of (a), (b), or (c); or 

introducing a test molecule to a system which comprises a protein encoded by a nucleotide 
sequence of (i), (ii), (iii), or (iv); and 

(b) determining the presence or absence of an interaction between the test molecule and the 
nucleic acid or protein, 

whereby the presence of an interaction between the test molecule and the nucleic acid or protein 
identifies the test molecule as a candidate therapeutic for treating osteoarthritis. 

22. The method of claim 21, wherein the system is an animal. 

23. The method of claim 21, wherein the system is a cell. 

24. The method of claim 2 1 , wherein the nucleotide sequence comprises one or more 
polymorphic variations associated with osteoarthritis. 

25. The method of claim 24, wherein the one or more polymorphic variations associated with 
osteoarthritis are at one or more positions in claim 3, 4 or 5. 

26. A method for treating osteoarthritis in a subject, which comprises contacting one or more 
cells of a subject in need thereof with a nucleic acid, wherein the nucleic acid comprises a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ED NO: 1-2; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-2; 
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(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-2; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); and 

(e) a nucleotide sequence complementary to the nucleotide sequences of (a), (b), (c), or (d); 
whereby contacting the one or more cells of the subject with the nucleic acid treats the 

osteoarthritis in the subject. 

27. The method of claim 26, wherein the nucleic acid is RNA or PNA. 

28. The method of claim 27, wherein the nucleic acid is duplex RNA. 

29. A method for treating osteoarthritis in a subject, which comprises contacting one or more 
cells of a subject in need thereof with a protein, wherein the protein is encoded by a nucleotide sequence 
which comprises a polynucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-2; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-2; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-2; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); 

whereby contacting the one or more cells of the subject with the protein treats the osteoarthritis 
in the subject. 

30. A method for treating osteoarthritis in a subject, which comprises: 

detecting the presence or absence of one or more polymorphic variations associated with 
osteoarthritis in a nucleic acid sample from a subject, wherein the one or more polymorphic variation are 
detected in a nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-2; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-2; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-2; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 
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administering an osteoarthritis treatment to a subject in need thereof based upon the presence or 
absence of the one or more polymorphic variations in the nucleic acid sample. 

3 1 . The method of claim 30, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 3, 4 or 5. 

32. The method of claim 30, wherein the treatment is selected from the group consisting of 
administering a corticosteroid, a corticosteroid, a nonsteroidal anti-inflammatory drug (NSAJD), a 
cyclooxygenase-2 (COX-2) inhibitor, an antibody, a glucocorticoid, hyaluronic acid, chondrotin sulfate, 
glucosamine or acetaminophen; prescribing a heat/cold regimen or a joint protection regimen; performing 
joint surgery; prescribing a weight control regimen; and combinations of the foregoing. 

33. A method for detecting or preventing osteoarthritis in a subject, which comprises: 
detecting the presence or absence of one or more polymorphic variations associated with 

osteoarthritis in a nucleic acid sample from a subject, wherein the polymorphic variation is detected in a 
nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1 -2; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-2; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-2; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

administering an osteoarthritis prevention or detection procedure to a subject in need thereof 
based upon the presence or absence of the one or more polymorphic variations in the nucleic acid sample. 

34. The method of claim 33, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 3, 4 or 5. 

35. The method of claim 33, wherein the osteoarthritis prevention is selected from the group 
consisting of administering a corticosteroid, a nonsteroidal anti-inflammatory drug (NSAID), a 
cyclooxygenase-2 (COX-2) inhibitor, an antibody, a glucocorticoid, hyaluronic acid, chondrotin sulfate, 
glucosamine or acetaminophen; prescribing a heat/cold regimen or a joint protection regimen; performing 
joint surgery; prescribing a weight control regimen; and combinations of the foregoing. 
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36. A method of targeting information for preventing or treating osteoarthritis to a subject in 
need thereof, which comprises: 

detecting the presence or absence of one or more polymorphic variations associated with 
osteoarthritis in a nucleic acid sample from a subject, wherein the polymorphic variation is detected in a 
nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-2; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-2; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-2; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

directing information for preventing or treating osteoarthritis to a subject in need thereof based 
upon the presence or absence of the one or more polymorphic variations in the nucleic acid sample. 

37. The method of claim 36, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 3, 4 or 5. 

38. A composition comprising a cell from a subject having osteoarthritis or at risk of 
osteoarthritis and an antibody that specifically binds to a protein, polypeptide or peptide encoded by a 
nucleotide sequence identical to or 90% or more identical to a nucleotide sequence in SEQ ID NO: 1-2. 

39. A composition comprising a cell from a subject having osteoarthritis or at risk of 
osteoarthritis and a RNA, DNA, PNA or ribozyme molecule comprising a nucleotide sequence identical 
to or 90% or more identical to a portion of a nucleotide sequence in SEQ ID NO: 1-2. 

40. The composition of claim 39, wherein the RNA molecule is a short inhibitory RNA 
molecule. 
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Abstract of the Disclosure 



Provided herein are methods for identifying a risk of osteoarthritis in a subject, reagents and kits 
for carrying out the methods, methods for identifying candidate therapeutics for treating osteoarthritis, 
and therapeutic and preventative methods applicable to osteoarthritis. These embodiments are based 
upon an analysis of polymorphic variations in nucleotide sequences within the human genome. 
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